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FIGURE 8 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCC£ISGGGGCCTGCCTGGGA 

GCCTGCTCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAG 

CTGCTGCCCCGCCAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCC 

TGGGGGTGCTGGTGTCCATCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTG 

CCCTGGGTGTGTGAGGAGGGGGCCGGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGG 

CTCCCTGCTTGGCTACCGCGCTGTCTACCGCATGTGCTTCGCCACGGCGGCCTTCTTCTTCT 

TCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCAGCCGGGACCCCCGGGCTGCCATCCAG 

AATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTCACCGTGGGTGCCTTCTACAT 

CCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGGCTCCTTCCTCTTCA 

TCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGTGGCTGGGC 

AAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTCTT 

CTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCT 

GCCACGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCT 

GCTGTCCTGCCCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCAT 

CACCCTCTACACCATGTTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCA 

AC C C CCATTTG C CAAC G CAGCTGGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAG 

ACCCAGTGGTGGGATGCCCCGAGCATTGTGGGCCTCATCATCTTCCTCCTGTGCACCCTCTT 

CATCAGTCTGCGCTCCTCAGACCACCGGCAGGTGAACAGCCTGATGCAGACCGAGGAGTGCC 

CACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGTGGCAGCCTGTGAGGGCCGGGCC 

TTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCCACTTCTGCCTGGTGCT 

GGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAGACCCGGAAGA 

TGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCTCCTC 

TACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCT^GGCAGCCT 

CACAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCT 

GCCCCCTCCCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTG 

CCCCTGAGCCGGGCCTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCA 

GAGCCCCATCCCCCCGCCACACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGT 

TGCCCATACTCAGCATCTCGGATGAAAGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGG 

CTGCTGGAGAGAGCGGGGAACTCCCACCACAGTGGGGCATCCGGCACTGAAGCCCTGGTGTT 

CCTGGTCACGTGCCCCAGGGGACCCTGCCCCCTTCCTGGACTTCGTGCCTTACTGAGTCTCT 
AAGAGTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGU RE 9 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVE 
SQLYKLPWVCEEGAG I FTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRD 
PRAAIQNGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSW 
NQRWLGKAEECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTSPSGCHEGKVFISLNLTFC 
VCVSIAAVLPKVQDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWA 
GPEGYETQWWDAPS IVGLI IFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVA 
ACEGRAFDNEQDGVT YS YS FFHFCLVIASLHVMMTLTNWYKPGETRKM STWTAVWVKI CAS 
WAGLLLYLWTLVAPLLLRNRDFS 
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FIG UR E 1 0 

GAGCGAGGCCGGGGACTGAAGGTGTGGGTGTCGAGCCCTCTGGCAGAGGGTTAACCTGGGTC 
AAATGCACGGATTCTCACCTCGTACAGTTACGCTCTCCCGCGGCACGTCCGCGAGGACTTGA 
AGTCCTGAGCGCTCAAGTTTGTCCGTAGGTCGAGAGAAGGCC^TSGAGGTGCCGCCACCGGC 
ACCGCGGAGCTTTCTCTGTAGAGCATTGTGCCTATTTCCCCGAGTCTTTGCTGCCGAAGCTG 
TGACTGGCGATTCGGAAGT CCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCC 
TATTACCCGGAATCTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAAGAGCAGAG 
AATTTCAAAGGAGCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGG 
TGTAT G G GGG AATAC CAG CTTTTATT CATGCTAAACAACAAT ACATTGAGCAGAGCCAGGCA 
GAAATTTATCATAACCGGTTTGATGCTGTGCAATCTGCACATCGTGCTGCCACACGAGGCTT 
CATTCGTTATGGCTGGCGCTGGGGTTGGAGAACTGCAGTGTTTGTGACTATATTCAACACAG 
TGAACACTAGTCTGAATGTATACCGAAATAAAGATGCCTTAAGCCATTTTGTAATTGCAGGA 
GCTGTCACGGGAAGTCTTTTTAGGATAAACGTAGGCCTGCGTGGCCTGGTGGCTGGTGGCAT 
AATTGGAGCCTTGCTGGGCACTCCTGTAGGAGGCCTGCTGATGGCATTTCAGAAGTACGCTG 
GTG AGAC TGTT CAGGAAAG AAAACAGAAGGATCGAAAGGC ACT C CATGAGCTAAAACTGGAA 
GAGTGGAAAGGCAGACTACAAGTTACTGAGCACCTCCCTGAGAAAATTGAAAGTAGTTTACG 
GG AAG ATGAAC CTG AG AATG ATGCTAAGAAAATTGAAG CACTG CTAAACCTT CCTAGAAAC C 
CTTCAGTAATAGATAAACAAGACAAGGAC2S&AAGTGCTCTGAACTTGAAACTCACTGGAGA 
GCTGAAGGGAGCTGCCATGTCCGATGAATGCCAACAGACAGGCCACTCTTTGGTCAGCCTGC 
TGACAAATTTAAGTGCTGGTACCTGTGGTGGCAGTGGCTTGCTCTTGTCTTTTTCTTTTCTT 
TTTAACTAAGAATGGGGCTGTTGTACTCTCACTTTACTTATCCTTAAATTTAAATACATACT 
TATGTTTGTATTAATCTATCAATATATGCATACATGGATATAT CCAC CCACCTAGATTTTAA 
GCAGTAAATAAAACATTTCGCAAAAGATTAAAGTTGAATTTTACAGTTT 
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FIGURE 11 

></usr/seqdb2/sst:/DNA/Dnaseqs,min/ss .DNA23318 
xsubunit 1 of 1 , 285 aa, 1 stop 
><MW: 32190, pi: 9.03, NX(S/T): 2 

MSVPPPAPRSFLCRALCLFPRVFAAEAVTADSEVLEERQKRLPT7PEPYYPESGWDRLRELF 
GKDEQQRISKDLANICKTAATAGIIGWVYGGIPAFIHAKQQYIEQSQAEIYHNRFDAVQSAH 
RAATRGFIRYGWRWGWRTAVFVTIFNTVOTSLNVYR^ 

GLVAGG 1 1 GALLGTPVGGLLMAFQKYAGETVQERKQKDRKALHELKXjEEWKGRLQVTEKLPE 
KIESSLREDEPENDAKKIEALLNLPRNPSVIDKQDKD 

Important Features: 
Signal Peptide: 

amino acids 1-2 4 

Transmembrane domains : 

amino acids 7 6-96 and amino acids 171-195 

N-glycosylation site: 

amino acids 153-156 
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FIGURE 12 

CGGAAGTCCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 
ATCTGGATGGGACCGCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGGA 
CCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGAA 
TACCAGCTTTTATTCATGCTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATCAT 
AACCGGTTTGATGCTGTGCAATCTGCACATCGTGCTGCCACACGAGGCTTCATTCGTTCATG 
GCTGGCGCCGAACC 
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FIGURE 13 

TCAAGTTTGTCCGTAGGTCGAGAGAAGGCCATGGAGGTGCCGCCACCGGCACCGCGGAGCTT 

TTTTCTGTAGAGCATTGTGCCTATTTCCCCGAGTTTTTGCTGCCGAAGCTGTGACTGCCGAT 

TCGGAAGTCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 

ATTTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGG 

ACCTTGCTGATATNTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGA 

ATACCAGCTTTTATTCATGNTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATNA 
TAACC 
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FIG U RE 14 



GAGCCGCCGCCGCGCGCGCGCCGCGOVCTGCAGCCCCAGGCCCCGGCCCCCCACCCACQTCT 
GCGTTGCTGCCCCGCCTGGGCCAGGCCCCAAAGGCAAGGACAAAGCAGCTGTCAGGGAACCT 
CCGCCGGAGTCGAATTTACGTGCAGCTGCCGGCAACCACAGGTTCCAAGAISGTTTGCGGGG 
GCTTCGCGTGTTCCAAGAACTGCCTGTGCGCCCTCAACCTGCTTTACACCTTGGTTAGTCTG 
CTGCTAATTGGAATTGCTGCGTGGGGCATTGGCTTCGGGCTGATTTCCAGTCTCCGAGTGGT 
CGGCGTGGTGATTGCAGTGGGCATCTTCTTGTTCCTGATTGCTTTAGTGGGTCTGATTGGAG 
CTGTAAAACATCATCAGGTGTTGCTATTTTTTTATATGATTATTCTGTTACTTGTATTTATT 
GTTCAGTTTTQTGTATCTTGCGCTTGTTTAGCCCTGAACCAGGAGCAACAGGGTCAGCTTCT 
GGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAATCTAAACTGCT 
GTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTCTGGCTAGCTGTGTTAAAAGTGACCAC 
TCGTGCTCGCCATGTGCTCCAATCATAGGAGAATATGCTGGAGAGGTTTTGAGATTTGTTGG 
TGGCATTGGC CTGTTCTT CAGTTTTACAGAGAT CCTGGGTGTTTGGCTGACCTACAGATACA 
GGAAC CAGAAAGACC C C.CGCGCGAATCCTAGTGCATTCCTTIS&TGAGAAAACAAGGAAGAT 
TTCCTTTCGTATTATGATCTTGTTCACTTTCTGTAATTTTCTGTTAAGCTCCATTTGCCAGT 
TTAAGGAAGGAAAC ACTAT CTGGAAAAGT AC CTTATTGAT AGTGGAATT ATATATTTTTACT 
CTATGTTTCTCTACATGTTTTTTTCTTTCCGTTGCTGAAAAATATTTGAAACTTGTGGTCTC 
TGAAGCTCGGTGGCACCTGGAATTTACTGTATTCATTGTCGGGCACTGTCCACTGTGGCCTT 
TCTTAGCATTTTTACCTGCAGAAAAACTTTGTATGGTACCACTGTGTTGGTTATATGGTGAA 
T CTGAACGT ACAT CT CACTG GTATAATTATATGTAGCACTGTG CTGTGTAGATAGTT C CTAC 
TGGAAAAAGAGTGGAAATTTATTAAAATCAGAAAGTATGAGATCCTGTTATGTTAAGGGAAA 
TCCAAATTC C CAATTTTTTTTGGTCTTTTTAGGAAAGATTGTTGTGGTAAAAAGTGTTAGTA 
TAAAAATGATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAAATAGTTAT 
GTCTTAGGAAATTGTGGTTTAATTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTGGTTT 
C ATG AAAT GTT CTAATGT AT AATAACATTTAC CTT CAGC CT C CAT C AG AATGGAACGAGTTT 
TG AGT AAT C AGG AAGT AT AT CTATATG AT CTTG ATATTGTTTTATAATAATTTG AAGT CT AA 
AAG ACTG CATTTTTAAAC AAGTTAGTATTAATGCGTTGG C CCACGTAG CAAAAAGATATTTG 
ATTATCTTAAAAATTGTTAAATACCGTTTTCATGAAATTTCTCAGTATTGTAACAGCAACTT 
GTCAAACCTAAGCATATTTGAATATGATCTCCCATAATTTGAAATTGAAATCGTATTGTGTG 
GCTCTGTATATTCTGTTAAAAAATTAAAGGACAGAAACCTTTCTTTGTGTATGCATGTTTGA 
ATTAAAAGAAAGTAATGGAAG 
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FIGURE 15 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA39979 
xsubunit 1 of 1, 204 aa, I stop 
><MW: 22147, pi: 3.37, NX(S/T): 3 

MVCGGFACSKNCLCALNLLYTLVSLLLIGIAAWGIGFGLISSLRWGWIAVGIFLFLIALV 
GLIGAVKHHQVLLFFYMI ILLLVFIVQFSVSCACLALNQEQQGQLLEVGWNNTASARNDIQR 
NLNC C G FRS VNPNDT CLAS C VKS DHS CS PCAP 1 1 GEYAGEVLRFVGG I GLFFS FTE I LGVWL 
TYRYRNQKDPRANPSAFL 

Signal Peptide: 

amino acids 1-34 

Transmembrane domains: 

amino acids 47-63, 72-95 and 162-182 
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FIGURE 16 

TGATTGGAGCTGTAAAAJVANTCTTCAGGTGTTGTNATTTTTTTATATGATTATTCTGTAANT 
TGTATTTATTGTTCAGTTTTNTGTATCTTGCGCTTGTTTAGCCNTGAACCAGGAGCAACAGG 
GTCAGNTTNTGGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAAT 
NTAAACTGCTGTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTNTGGCTAGCTGTGTTAA 
AAGTGAC C ACTNGTG CT C G C CATGTGCT C CAATCA7AGGAGAATATGCTGGAGAGG TTTTGA 

GATTTGTTGGTGGCATTGGCCTGTTNTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACC 
TACAGAT AC AGGAAC CAG 
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FIGURE 17 

AATCCCAAATTCCCCAATTTTTTTGGNCTTTTTAGGGi^GATGTGTTGTGGTAAAAAGTGT 
TAG T AT AAAAA T G AT AAT T T AC T T GT AGT C T T TT AT GAT TAC AC CAAT GT AT T C TAGAATAG 
T TAT G T C T TAG GAAAT T G T GGT T TAATTT T T GAC TTT TACAGGTAAGTGCAAAGGAGAAGT G 
GTTTCATGAAATGTTCTAATGTATAATAACATTTACCTTCAGCCTCCCATCAGAATGGAACG 
AGTTTTGAGTAATCCAGGAAGTATATCTATATGATCTTGATATTGTTTTATATAATTTGAAG 
TCTAAAAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGAT 
ATTTGATTATCTTAAAAATTGTTAAATACCGTTTTCATGAAAGTTCTCAGTATTGTAACAGC 
AACTTGTCAAACCTAAGCATATTTGAATATGATCTCCCATAATTTGAAATTGAAATCGTATT 

GTGTGGAGGAAATGGCAATCTTATGTGTGCTGAAGGACACAGTAAGAGCACCAAGTTGTGCC 
CCACTTGC 
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FIGURE 18 



ATGATTATTCTGTTACTTGTATTTATTGTTCAGTTTTATGGTATCTTGCGCTTGTTTAGCCC 
CTGAAACCAGGAGCAACAGGGNNCAGCTTCCTGGAGGTTGGTTGGCAACAATCACGGCCAAG 
TGACTCCGCAAATGACATCCCAGAGAAATCCTAAACTGCTGTGGGTTCGGAAGTGTTAACCC 
AAATGACACCTGTCTGGCTNGCTGTGTTAAAAGTGACCACTCGTGCTCGCCATGTGCTCCAA 
TCATAGGAGAATATGC 
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FIGURE 19 

C AGT C AC C A2S AAGCT GGG CT GTGTC CT CATGGC CTGGGC C CT CT AC CTTTC C CTTGGT GT G 
CTCTGGGTGGCCCAGATGCTACTGGCTGCCAGTTTTGAGACGCTGCAGTGTGAGGGACCTGT 
CTGCACTGAGGAGAGCAGCTGCCAGACGGAGGATGACTTGACTGATGCAAGGGAAGCTGGCT 
TCCAGGTCAAGGCCTACACTTTCAGTGAACCCTTCCACCTGATTGTGTCCTATGACTGGCTG 
ATCCTCCAAGGTCCAGCCAAGCCAGTTTTTGAAGGGGACCTGCTGGTTCTGCGCTGCCAGGC 
CTGGCAAGACTGGCCACTGACTCAGGTGACCTTCTACCGAGATGGCTCAGCTCTGGGTCCCC 
C C GGG C CTAAC AGGG AAT T C TC CATCAC CGTGGTACAAAAGGCAGACAGCGGGCACTAC CAC 
TGCAGTGGCATCTTCCAGAGCCCTGGTCCTGGGATCCCAGAAACAGCATCTGTTGTGGCTAT 
CACAGTCCAAGAACTGTTTCCAGCGCCAATTCTCAGAGCTGTACCCTCAGCTGAACCCCAAG 
CAGGAAGCC C CATGAC C CTGAGTTGTCAGACAAAGTTGCCCCTGCAGAGGTCAGCTGCCCGC 
CT CCT CTT CT CCTTCT AC AAGGATGGAAGGATAGTGCAAAGC AGGGGGCTCTCCTCAGAATT 
C C AGAT CC C CAC AG CTT CAGAAGAT CACT C CGGGTCATACTGGTGTGAGGCAGC CACTGAGG 
ACAAC C AAGTTTGGAAACAG AGC CCC GAGCTAGAGAT CAGAGTGCAGGGTGCTT C CAGCTCT 
GCTGCACCTCCCACATTGAATCCAGCTCCTCAGAAATCAGCTGCTCCAGGAACTGCTCCTGA 
GGAGGCCCCTGGGCCTCTGCCTCCGCCGCCAACCCCATCTTCTGAGGATCCAGGCTTTTCTT 
CTCCTCTGGGGATGCCAGATCCTCATCTGTATCACCAGATGGGCCTTCTTCTCAAACACATG 
CAGGATGTGAGAGTCCTCCTCGGTCACCTGCTCATGGAGTTGAGGGAATTATCTGGCCACCA 
GAAG C CTGGGAC CAC AAAGG CTACTGCTGAAXSfiAAGTAAACAGTTCAT C CATGAT CTCACT 
TAAC CACC C CAATAAAT CTG ATTCTTTATTTT CTCTTC CTGTCCTGCACATATG CATAAGTA 
CTTTTAC AAGTTGT C C C AGT GTTTTGTTAGAATAATGT AGTTAGGTGAGTGT AAATAAATTT 
ATATAAAGTGAGAATTAGAGTTTAGCTATAATTGTGTATTCTCTCTTAACACAACAGAATTC 
TG CTGT CTAG AT CAGG AATTT CTATCTGTTATATCGAC CAGAATGTTGTGATTTAAAGAGAA 
CTAATGGAAGTGGATTGAATACAGCAGTCTCAACTGGGGGCAATTTTGCCCCCCAGAGGACA 
TTGGGCAATGTTTGGAGACATTTTGGTCATTATACTTGGGGGGTTGGGGGATGGTGGGATGT 
GTGTCTACTGGCATCCAGTAAATAGAAGCCAGGGGTGCCGCTAAACATCCTATAATGCACAG 
GG CAGT A C C C C AC AACGAAAAATAAT CTGGC CCAAAATGTCAGTTGTACTGAGTTTGAGAAA 
CCC CAG C CT AATG AAA C C CT AGGTGTTGGGCTCTGGAATGGGACTTTGT C C CTT CTAATTAT 
TAT CT CTTT C CAG C CT C ATT CAGCTATTCTTACTGACATACCAGT CTTTAGCTGGTG CTATG 
GT CT GTT CTTTAGTT CTAGTTTGTATCC C CTCAAAAGC CATTATGTTGAAATCCTAAT C CC C 
AAGGTGATGGCATTAAGAAGTGGGCCTTTGGGAAGTGATTAGATCAGGAGTGCAGAGCCCTC 
ATGATTAGG ATT AGTG C C CTTATTTAAAAAGGCCC CAGAGAGCT AACT CACCCTT CCAC CAT 
ATGAGGACGTGGC AAGAAGATGACATGT ATGAGAAC CAAAAAACAGCTGT CGCCAAACACCG 
ACTCTGTCGTTGCCTTGATCTTGAACTTCCAGCCTCCAGAACTATGAGAAATAAAATTCTGG 
TTGTTTGTAGCCTAA 
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FI GURE 2 0 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA40594 
xsubunit 1 of 1, 359 aa, 1 stop 
><MW: 38399, pi: 5.21, NX(S/T): 0 

MKLGCVLMAWALYLSLGVLWVAQMLLAAS FETLQCEGFVCTEESSCHTEDDLTDAREAGFQV 
KAYTFSEPFHLIVSYDWLILQGPAKPVFEGDLLVLRCQAWQDWPLTQVTFYRDGSALGPPGP 
NREFSITWQKADSGHYHCSGIFQSPGPGIPETASWAITVQELFPAPILRAVPSAEPQAGS 
PMTLSCQTKLPLQRSAARLLFSFYKDGRIVQSRGLSSEFQIPTASEDHSGSYWCEAATEDNQ 
VWKQSPQLEIRVQGASSSAAPPTLNPAPQKSAAPGTAPEEAPGPLPPPPTPSSEDPGFSSPL 
GMPDPHLYHQMGLLLKHMQDVRVLLGHLLMELRELSGHQKPGTTKATAE 

Leucine zipper pattern sequence: 

amino acids 12-33 

Protein kinase C phosphorylation site: 
amino acids 353-355 
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CCCACGCGTCCGCCCACGCGTCCGCCCACGGGTCCGCCCACGCGTCCGGGCCACCAGAAGTT 
TGAGCCTCTTTGGTAGCAGGAGGCTGGAAGAAAGGACAGAAGTAGCTCTGGCTGTGaiBGGG 
ATCTTACTGGGCCTGCTACTCCTGGGGCACCTAACAGTGGACACTTATGGCCGTCCCATCCT 
GGAAGT G C C AG AGAGT GTAAC AGGAC CTTGGAAAGGGGATGTGAATCTT C C CTG C ACCTATG 
ACCCCCTGCAAGGCTACACCCAAGTCTTGGTGAAGTGGCTGGTACAACGTGGCTCAGACCCT 
GT CA C CAT CTTT CTA CGT GACT CTTCTGG AGAC CATAT C CAGCAGGCAAAGTAC CAGGGC CG 
C CTG C ATGT GAG C CA CAAGGTT C CAGGAGATGTATC C CT C CAATTGAGCAC CCTGGAGATGG 
ATGACCGGAGCCACTACACGTGTGAAGTCACCTGGCAGACTCCTGATGGCAACCAAGTCGTG 
AGAGATAAGATTACTGAGCTCCGTGTCCAGAAACTCTCTGTCTCCAAGCCCACAGTGACAAC 
TGGCAGCGGTTATGGCTTCACGGTGCCCCAGGGAATGAGGATTAGCCTTCAATGCCAGGCTC 
GGGGTTCTCCTCCCATCAGTTATATTTGGTATAAGCAACAGACTAATAACCAGGAACCCATC 
AAAGTAGCAACCCTAAGTACCTTACTCTTCAAGCCTGCGGTGATAGCCGACTCAGGCTCCTA 
TTT CTG CACT G C CAAGGG C CAGGTTGGCTCTGAGCAGCACAGCGACATTGTGAAGTTTGTGG 
TCAAAGACT C CT CAAAGCTACTCAAGACCAAGACTGAGGCACCTACAACCATGACATACCCC 
TTGAAAGCAACATCTACAGTGAAGCAGTCCTGGGACTGGACCACTGACATGGATGGCTACCT 
TGGAGAGACCAGTGCTGGGCCAGGAAAGAGCCTGCCTGTCTTTGCCATCATCCTCATCATCT 
CCTTGTGCTGTATGGTGGTTTTTACCATGGCCTATATCATGCTCTGTCGGAAGACATCCCAA 
CAAGAG C ATGT CTAC GAAG C AG CCAGGISAGAAAGT CTCT C CTCTT CCATTTTTGAC C C CGT 
CC CTG C C CT C AATTTTGATTACTGGCAGGAAATGTGGAGGAAGGGGGGTGTGGCAC AGACC C 
AATCCTAAGGCCGGAGGCCTTCAGGGTCAGGACATAGCTGCCTTCCCTCTCTCAGGCACCTT 
CTGAGGTTGTTTTGGCCCTCTGAACACAAAGGATAATTTAGATCCATCTGCCTTCTGCTTCC 
AGAATCCCTGGGTGGTAGGATCCTGATAATTAATTGGCAAGAATTGAGGCAGAAGGGTGGGA 
AACCAGGACCACAGCCCCAAGTCCCTTCTTATGGGTGGTGGGCTCTTGGGCCATAGGGCACA 
TGCCAGAGAGGCCAACGACTCTGGAGAAACCATGAGGGTGGCCATCTTCGCAAGTGGCTGCT 
CCAGTGATGAGCCAACTTCCCAGAATCTGGGCAACAACTACTCTGATGAGCCCTGCATAGGA 
CAGGAGTACCAGATCATCGCCCAGATCAATGGCAACTACGCCCGCCTGCTGGACACAGTTCC 
TCTGGATTATGAGTTTCTGGCCACTGAGGGCAAAAGTGTCTGTTAAAAATGCCCCATTAGGC 
CAGGATCTGCTGACATAATTGCCTAGTCAGTCCTTGCCTTCTGCATGGCCTTCTTCCCTGCT 
ACCT CTCTT CCTGGATAGCCCAAAGTGTCCGCCTACGAA.CACTGGAGCCGCTGGGAGTCACT 
GGCTTTGCCCTGGAATTTGCCAGATGCATCTCAAGTAAGCCAGCTGCTGGATTTGGCTCTGG 
GCCCTTCTAGTATCTCTGCCGGGGGCTTCTGGTACTCCTCTCTAAATACCAGAGGGAAGATG 
CCCATAGCACTAGGACTTGGTCATCATGCCTACAGACACTATTCAACTTTGGCATCTTGCCA 
C CAGAAG AC C C G AGGG AGG CT CAGCTCTGC CAG CT CAGAGGAC CAGCTATATC CAGGAT CAT 
TTCT CTTT CTT CAGGGC CAGACAGCTTTTAATTGAAATTGTTATTTCACAGGCCAGGGTTCA 

GTTCTGCTCCTCCACTATAAGTCTAATGTTCTGACTCTCTCCTGGTGCTCAATAAATATCTA 
AT CATAACAGC 
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></usr/seqdb2/sst/-DNA/Dnaseqs .min/ss . DNA45416 
xsubunit 1 of 1, 321 aa, 1 stop 
XMW: 35544, pi: 8.51, NX(S/T): 0 

MGILLGLLLLGHLTVDfYGRPILEVPESVTGPWKGDVNLPCTYDPLQGYTQVLVKWLVQRGS 
DPVTIFLRDSSGDHIQQAKYQGRLHVSHKVPGDVSLQLSTLJ5MDDRSHYTCEVTWQTPDGNQ 
VVRDKITELRVQKLSVSKPTVTTGSGYGFTVPQGMRISLQCQARGSPPISYIWYKQQTNNQE 
PIKVATLSTLLFKPAVIADSGSYFCTAKGQVGSEQHSDIVKFVVKDSSKLLKTKTEAPTTMT 

YPLKATSTVKQSWDWTTDMDGYLGETSAGPGKSLPVFAIILIISLCCMWFTMAYIMLCRKT 
SQQEHVYEAAR 

Glycosaminoglycan attachment site: 

amino acids 149-152 

Transmembrane domain: 

amino acids 27 6-306 
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FI GURE 2 3 

GCGCCGGGAGCCCATCTGCCCCCAGGGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCACAT 
GGCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGTCGGA 
GGCGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCGGGGATC 
GGGAI2TCCCTCCTCCTTCTCCTCTTGCTAGTTTCCTACTATGTTGGAACCTTGGGGACTCA 
CACTGAGATCAAGAGAGTGGCAGAGGAAAAGGTCACTTTGCCCTGCCACCATCAACTGGGGC 
TTCCAGAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAAGGGAACCAAAAA 
GTGGT G AT C AC TTACT C CAGTCGT CATGT CTACAATAACTTGACTGAGGAACAGAAGGGCCG 
AGTGG C CTTT G CTT C C AATTTCCTGGC AGGAGATGCCTC CTTGC AGATTGAAC CT CTGAAGC 
CCAGTGATGAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTGTGGAGCCAT 
GT CAT CT T AAAAGT CTT AGTGAGAC CAT C CAAGC C CAAGTGTGAGTTGGAAGGAGAGCTGAC 
AGAAGGAAGTGACCTGACTTTGCAGTGTGAGTCATCCTCTGGCACAGAGCCCATTGTGTATT 
ACTGGCAGCGAATCCGAGAGAAAGAGGGAGAGGATGAACGTCTGCCTCCCAAATCTAGGATT 
GACTACAACCACCCTGGACGAGTTCTGCTGCAGAATCTTACCATGTCCTACTCTGGACTGTA 
CCAGTGCACAGCAGGCAACGAAGCTGGGAAGGAAAGCTGTGTGGTGCGAGTAACTGTACAGT 
ATGTACAAAGCATCGGCATGGTTGCAGGAGCAGTGACAGGCATAGTGGCTGGAGCCCTGCTG 
ATTTTC CT CTTGGTGTGGCTGCTAAT C CGAAGGAAAGACAAAGAAAGATATGAGGAAG AAG A 
GAGACCTAATGAAATTCGAGAAGATGCTGAAGCTCCAAAAGCCCGTCTTGTGAAACCCAGCT 
CCTCTTCCTCAGGCTCTCGGAGCTCACGCTCTGGTTCTTCCTCCACTCGCTCCACAGCAAAT 
AGTGCCTCACGCAGCCAGCGGACACTGTCAACTGACGCAGCACCCCAGCCAGGGCTGGCCAC 
C CAGG CAT A C AG C CTAGTGGGG C CAGAGGTGAGAGGTTCTGAAC CAAAGAAAGTC C ACCATG 
CTAATCTGAC CAAAGCAGAAACCACACCCAGCATGATCCCCAGC CAGAGCAGAGCCTTCCAA 
ACGGTCIS&ATTACAATGGACTTGACTCCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTC 
TT CT CGT.C ATT GGAG CT C AAGT C AC C AGC CAC ACAAC CAGATGAGAGGT CAT CTAAGTAGC A 
GTGAGCATTGCACGGAACAGATTCAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAA 
AGGAT GT AAG CTG ATT CATCTGTAAAAAGGCATCTTATTGTGCCTTTAGAC CAGAGTAAGGG 
AAAGCAGGAGTCCAAATCTATTTGTTGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTG 
AGGTGAATATACCTAAAACTTTTAATGTGGGATATTTTGTATCAGTGCTTTGATTCACAATT 
TTCAAGAGGAAATGGGATGCTGTTTGTAAATTTTCTATGCATTTCTGCAAACTTATTGGATT 
ATTAGTTATTCAGACAGTCAAGCAGAACCCACAGCCTTATTACACCTGTCTACACCATGTAC 
TGAG CTAAC C ACTTCTAAGAAACT C CAAAAAAGGAAACATGTGTCTT CTATTCTGACTTAAC 
TT CATTTGT CAT AAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATAGTGGGAGATGGAGA 
AGAGTG AATGAGTTT CT C C C ACT CTATACTAAT CTCACTATTTGTATTGAG C C CAAAATAAC 
TATGAAAGGAGACAAAAATTTGTGACAAAGGATTGTGAAGAGCTTTCCATCTTCATGATGTT 
ATGAGGATTGTTGACAAACATTAGAAATATATAATGGAGCAATTGTGGATTTCCCCTCAAAT 
CAGATG C CT CTAAGGACTTTCCTGCTAGATATTTCTGGAAGGAGAAAATACAACATGTCATT 
TAT CAACG T C CTT AG AAAG AATT CTT CT AG AG AAAAAGGGAT CTAGG AATG CTGAAAGATTA 
CCCAACATACCATTATAGTCTCTTCTTTCTGAGAAAATGTGAAACCAGAATTGCAAGACTGG 
GT GGACT AG AAAGGGAG ATT AGAT CAGTTTT CT CTT AATATGT CAAGGAAGGT AG CCGGGCA 
TGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAGGTTGCAGTGAGCCGAGATTATGCC 
ATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 
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FIGURE 24 

></usr/ seqdb2/sst/DNA/Dnaseqs .m±n/ ss .DNA45419 
xsubunit 1 of i, 373 aa, 1 stop 
><MW: 41231, pi: 3,33, NX(S/T): 3 

MSLLLLLLLVSYYVGTLGTKTEIKRVAEEKVTLPCHKQLGLPEKDTLDIEWLLTDNEGNQKV 
VITYSSRKVYOTTLTEEQKGRVAF^^ 

ILKVLVRPSKPKCELEGELTEGSDLTLQCSSSSGTEPIVYYWQRIREKEGEDERLPPKSRID 
YNHPGRVLLQNLTMSYSGLYQCTAGNEAGKESCVVRVTVQYVQSIG^1VAGAVTGIVAGALLI 
FLLVWLLIRRKDKERYEEEERPNEIREDAEAPKARLVKPSSSSSGSRSSRSGSSSTRSTANS 
ASRSQRTLSTDAAPQPGLATQAYSLVGPETOGSEPKKVHHANLTKAETTPSMIPSQSRAFQW 

Transmembrane domain: 

amino acids 221-254 
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FIGURE 25 

GTCGTTCCTTTGCTCTCTCGCGCCCAGTCCTCCTCCCTGGTTCTCCTCAGCCGCTG^CGGAG 

GAGAGCACCCGGAGACGCGGGCTGCAGTCGCGGCGGCTTCTCCCCGCCTGGGCGGCCTC^CC 

GCTGGGCAGGTGCTGAGCGCCCCTAGAGCCTCCCTTGCCGCCTCCCTCCTCTGCCCGGCCGC 

AGCAGTGCACATGGGGTGTTGGAGGTAGATGGGCTCCCGGCCCGGGAGGCGGCGGTGGATGC 

GGCGCTGGGCAGAAGCAGCCGCCGATTCCAGCTGCCCCGCGCGCCCCGGGCGCCCCTGCGAG 

TCCCCGGTTCAGCCAXSGGGACCTCTCCGAGCAGCAGCACCGCCCTCGCCTCCTGCAGCCGC 

ATCGCCCGCCGAGCCACAGCCACGATGATCGCGGGCTCCCTTCTCCTGCTTGGATTCCTTAG 
C AC C A C C AC AG CT C AG C C AGAACAGAAGG C CT CGAAT CT CATTGGCACATACCGC CATGTTG 

ACCGTGCCACCGGCCAGGTGCTAACCTGTGACAAGTGTCCAGCAGGAACCTATGTCTCTGAG 
CATTGTACCAACACAAGCCTGCGCGTCTGCAGCAGTTGCCCTGTGGGGACCTTTACCAGGCA 
TGAGAATGG CATAGAGAAATGCCATGACTGTAGTCAGCCATGC CCATGGCCAATGATTGAGA 
AATTACCTTGTGCTGCCTTGACTGACCGAGAATGCACTTGCCCACCTGGCATGTTCCAGTCT 
AACGCTAC CT GTG C CC C C CATACGGTGTGTC CTGTGGGTTGGGGTGTGCGGAAGAAAGGGAC 

AGAGACTGAGGATGTGCGGTGTAAGCAGTGTGCTCGGGGTACCTTCTCAGATGTGCCTTCTA 
GTGTGATGAAATGCAAAGCATACACAGACTGTCTGAGTCAGAACCTGGTGGTGATCAAGCCG 
GGGACCAAGGAGACAGACAACGTCTGTGGCACACTCCCGTCCTTCTCCAGCTCCACCTCACC 
TTCCCCTGGCACAGCCATCTTTCCACGCCCTGAGCACATGGAAACCCATGAAGTCCCTTCCT 
CCACTTATGTTCCCAAAGGCATGAACTCAACAGAATCCAACTCTTCTGCCTCTGTTAGACCA 
AAGGT AC T GAG TAG CAT C C AGGAAGGG ACAGTCC CTGACAACACAAGCTCAGCAAGGGGGAA 
GGAAGACG T G AACAAGAC C CTC C CAAAC CTT CAGGTAGTCAACCAC CAGCAAGG CC C C CACC 

ACAGACACATCCTGAAGCTGCTGCCGTCCATGGAGGCCACTGGGGGCGAGAAGTCCAGCACG 
C C CAT CAAGGG C C C C AAGAGGGGAC AT C CTAGACAGAACCTACAC AAGC ATTTTG ACATCAA 

TGAGCATTTGCCCTGGATGATTGTGCTTTTCCTGCTGCTGGTGCTTGTGGTGATTGTGGTGT 
GC AGT AT C C GG AAAAG CT CG AGGACTCTGAAAAAGGGGC C CCGGCAGGATC CCAGTGC CATT 
GTGGAAAAGG C AGGGCTGAAGAAAT C CATG ACT CCAAC C CAGAAC CGGGAGAAATGGATCTA 
CTACTGCAATGGCCATGGTATCGATATCCTGAAGCTTGTAGCAGCCCAAGTGGGAAGCCAGT 
GGAAAGATATCTATCAGTTTCTTTGCAATGCCAGTGAGAGGGAGGTTGCTGCTTTCTCCAAT 
GGGTACACAGCCGACCACGAGCGGGCCTACGCAGCTCTGCAGCACTGGACCATCCGGGGCCC 
CGAGGCCAGCCTCGCCCAGCTAATTAGCGCCCTGCGCCAGCACCGGAGAAACGATGTTGTGG 
AGAAGATTCGTGGGCTGATGGAAGACACCACCCAGCTGGAAACTGACAAACTAGCTCTCCCG 
ATGAGCCCCAGCCCGCTTAGCCCGAGCCCCATCCCCAGCCCCAACGCGAAACTTGAGAATTC 
CGCTCTCCTGACGGTGGAGCCTTCCCCACAGGACAAGAACAAGGGCTTCTTCGTGGATGAGT 
CGGAGCCCCTTCTCCGCTGTGACTCTACATCCAGCGGCTCCTCCGCGCTGAGCAGGAACGGT 
TCCTTTATTACCAAAGAAAAGAAGGACACAGTGTTGCGGCAGGTACGCCTGGACCCCTGTGA 
CTTGCAGCCTATCTTTGATGACATGCTCCACTTTCTAAATCCTGAGGAGCTGCGGGTGATTG 
AAGAG ATT C C C C AGG CTG AGGACAAACT AG AC CGGCTATTCGAAATTATTGGAGT C AAGAGC 

CAGGAAGCCAGCCAGACCCTCCTGGACTCTGTTTATAGCCATCTTCCTGACCTGCTGTaSAA 
CATAGGGATACTGCATTCTGGAAATTACTCAATTTAGTGGCAGGGTGGTTTTTTAATTTTCT 
TCTGTTTCTGATTTTTGTTGTTTGGGGTGTGTGTGTGTGTTTGTGTGTGTGTGTGTGTGTGT 
GTGTGTGTGTGTGTTTAACAGAGAATATGGCCAGTGCTTGAGTTCTTTCTCCTTCTCTCTCT 
CTCTTTTTTTTTTAAATAACTCTTCTGGGAAGTTGGTTTATAAGCCTTTGCCAGGTGTAACT 
GTTGTGAAATACCCACCACTAAAGTTTTTTAAGTTCCATATTTTCTCCATTTTGCCTTCTTA 
TGTATTTTCAAGATTATTCTGTGCACTTTAAATTTACTTAACTTACCATAAATGCAGTGTGA 
CTTTTCCCACACACTGGATTGTGAGGCTCTTAACTTCTTAAAAGTATAATGGCATCTTGTGA 
ATCCTATAAGCAGTCTTTATGTCTCTTAACATTCACACCTACTTTTTAAAAACAAATATTAT 
TACTATTTTTATTATTGTTTGTCCTTTATAAATTTTCTTAAAGATTAAGAAAATTTAAGACC 
C C ATTGAGTTACTGTAATG CAATT CAACTTTGAGTTAT CTTTTAAATATGT CTTGTATAGTT 

CATATTCATGGCTGAAACTTGACCACACTATTGCTGATTGTATGGTTTTCACCTGGACACCG 
TGTAGAATGCTTGATTACTTGTACTCTTCTTATGCTAATATGCTCTGGGCTGGAGAAATGAA 
ATCCTCAAGCCATCAGGATTTGCTATTTAAGTGGCTTGACAACTGGGCCACCAAAGAACTTG 
AACTTCACCTTTTAGGATTTGAGCTGTTCTGGAACACATTGCTGCACTTTGGAAAGTCAAAA 
TCAAGTGCCAGTGGCGCCCTTTCCATAGAGAATTTGCCCAGCTTTGCTTTAAAAGATGTCTT 
GTTTTTTATATACACATAAT CAATAGGTCCAAT CTGCTCTCAAGGCCTTGGTC CTGGTGGGA 

TTCCTTCACCAATTACTTTAATTAAAAATGGCTGCAACTGTAAGAACCCTTGTCTGATATAT 
TTGCAACTATGCTCCCATTTACAAATGTACCTTCTAATGCTCAGTTGCCAGGTTCCAATGCA 
AAGGTGGCGTGGACTCCCTTTGTGTGGGTGGGGTTTGTGGGTAGTGGTGAAGGACCGATATC 
AGAAAAATGCCTTCAAGTGTACTAATTTATTAATAAACATTAGGTGTTTGTTAAAAAAAAAA 
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FIGURE 26 



x/usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA52594 
xsufaunit 1 of 1, 655 aa, I stop 
xMW: 71845, pi: 8.22, NX(S/T): 8 

MGTSPSSSTALASCSRIARRATATMIAGSLLLLGFLSTTTAQPHQKASNLIGTYRHVDRATG 
QVLTCDKCPAGTYVSEHCTNTSLRVCSSCPVGTFTRHENGIEKCHDCSQPCPWPMIEKLPCA 
ALTDRE CT C P PGMFQSNAT CAPHTVCP VGWGVRKKGTETEDVRCXQCARGTFSDVP SS VMKC 
KAYTDCLSQNLWIKPGTKETDNVCGTLPSFSSSTSPSPGTAIFPRPEHMETHEVPSSTYVP 
KGMNSTESNSSASVRPKVLSSIQEGTVPDNTSSARGKEDVNKTLPNLQVVNHQQGPHHRHIL 
KLLPSMEATGGEKSSTPIKGPKRGHPRQNLHKHFDINEHLPWMIVLFLLLVLVVIVVCSIRK 
SSRTLKKGPRQDPSAIVEKAGLKKSMTPTQNREKWIYYCNGHGIDILKLVAAQVGSQWKDIY 
QFLCNASEREVAAFSNGYTADHERAYAALQHWTIRGPEASIAQLISALRQHRRNDVVEKIRG 
LMEDTTQLETDKLAL PMS PS PLS PS P I PSPNAKLENSALLTVEPS PQDKNKGFFVDESEPLL 

RCDSTSSGSSALSRNGSFITKEKKDTVLRQVRLDPCDLQPIFDDMLHFLNPEELRVIEEIPQ 
AEDKLDRLFE 1 1 G VKSQE ASQTLLDS VYSHLPDLL 
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FIGURE 27 

ATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGA 
CTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCAC 
CJMSGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCGCTTTTTGGCC 
TTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAGATGCTGTTGCTGCACAGATCCTG 
T C ACTG CT G C C ATTG AAGTTTTTT C CAAT CAT C GT CATTGGGAT CATTGCATTGATATTAGC 

ACTGGCCATTGGTCTGGGCATCCACTTCGACTGCTCAGGGAAGTACAGATGTCGCTCATCCT 
TTAAGTGTATCGAGCTGATAGCTCGATGTGACGGAGTCTCGGATTGCAAAGACGGGGAGGAC 
GAGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCTTC 
GTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCCCAAC 
TGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGGCAGTTC 
C GGGAGGAGTT T GT GT C CAT CGAT CAC CTCTTG C CAGATG ACAAGGTGACTGCATTACAC CA 

CTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAGTGCACAGCCT 
GTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCCTTGCTCTCGCAG 
TGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGGGGCTCTGTCATCAC 
GCCCCTGTGGATCATCACTGCTGCACACTGTGTTTATGACTTGTACCTCCCCAAGTCATGGA 
CCATCCAGGTGGGTCTAGTTTCCCTGTTGGACAATCCAGCCCCATCCCACTTGGTGGAGAAG 
ATTGTCTACCACAGCAAGTACAAGCCAAAGAGGCTGGGCAATGACATCGCCCTTATGAAGCT 
GGCCGGGCCACTCACGTTCAATGAAATGATCCAGCCTGTGTGCCTGCCCAACTCTGAAGAGA 
ACTTCCCCGATGGAAAAGTGTGCTGGACGTCAGGATGGGGGGCCACAGAGGATGGAGGTGAC 
GCCTCCCCTGTCCTGAACCACGCGGCCGTCCCTTTGATTTCCAACAAGATCTGCAACCACAG 
GGACGTGTACGGTGGCATCATCTCCCCCTCCATGCTCTGCGCGGGCTACCTGACGGGTGGCG 
TGGACAGCTGCCAGGGGGACAGCGGGGGGCCCCTGGTGTGTCAAGAGAGGAGGCTGTGGAAG 
TTAGTGGGAGCGACCAGCTTTGGCATCGGCTGCGCAGAGGTGAACAAGCCTGGGGTGTACAC 
CCGTGTCACCTCCTTCCTGGACTGGATCCACGAGCAGATGGAGAGAGACCTAAAAACCSS&A 
GAGGAAGGGGACAAGTAGCCACCTGAGTTCCTGAGGTGATGAAGACAGCCCGATCCTCCCCT 
GGACTCCCGTGTAGGAACCTGCACACGAGCAGACACCCTTGGAGCTCTGAGTTCCGGCACCA 
GTAG C AGGC C C G AAAG AGG CAC C CTTC CAT CTGATT CCAGCACAAC CTT CAAGCTGCTTTTT 

GTTTTTTGTTTTTTTGAGGTGGAGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGAAA 
TCCCTGCTCACTGCAGCCTCCGCTTCCCTGGTTCAAGCGATTCTCTTGCCTCAGCTTCCCCA 
GTAGCTGGGACCACAGGTGCCCGCCACCACACCCAACTAATTTTTGTATTTTTAGTAGAGAC 
AGGGTTTCACCATGTTGGCCAGGCTGCTCTCAAACCCCTGACCTCAAATGATGTGCCTGCTT 
CAGCCTCCCACAGTGCTGGGATTACAGGCATGGGCCACCACGCCTAGCCTCACGCTCCTTTC 
TGATCTTCACTAAGAACAAAAGAAGCAGCAACTTGCAAGGGCGGCCTTTCCCACTGGTCCAT 
CTGGTTTTCTCTCCAGGGTCTTGCAAAATTCCTGACGAGATAAGCAGTTATGTGACCTCACG 
TGCAAAGCCACCAACAGCCACTCAGAAAAGACGCACCAGCCCAGAAGTGCAGAACTGCAGTC 
ACTG CACGTTTT CAT CTCTAGGGACCAGAACCAAACC CAC CCTTTCTACTTCCAAGACTTAT 
TTTCACATGTGGGGAGGTTAATCTAGGAATGACTCGTTTAAGGCCTATTTTCATGATTTCTT 
TGTAGCATTTGGTGCTTGACGTATTATTGTCCTTTGATTCCAAATAATATGTTTCCTTCCCT 
CATTGTCTGGCGTGTCTGCGTGGACTGGTGACGTGAATCAAAATCATCCACTGAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA45234 
xsubunit 1 of 1, 453 aa, 1 stop 
><MW: 49334, pi: 6.32, NX(S/T): 1 



MGENDPPAVEAPPSFRSLFGLDDLKISPVAPDADAVAAQILSLLPLKFFPIIVIGIIALILA 

LAIGLGIHFDCSGKYRCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVFTAAS 
WXTMCSDDWKGHYAWACAQLGFPSYVSSD^ 

S VYVRE GC AS GHWTLQ CT ACGHRRG YS SR I VGGNMS LLS Q WPWQASLQ FQG YHLCGGS VI T 
PLWIITAAHCVYDLYLPKSWTIQVGLVSLL^ 

AGPLTFNEM I Q P VCL PNS EENFPDGKVCWTSGWGATEDGGDAS PVLNHAAVPL I SNKI CNHR 

DVYGGIISPSMLCAGYLTGGVDSCQGDSGGPLVCQERRLWKLVGATSFGIGCAEVNKPGVYT 
RVTS FLDW I HEQMERDLKT 
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FIGUR E 2 9 



CCCACGCGTCCGTCCTAGTCCCCGGGCCAACTCGGACAGTTTGCTCATTTATTGCAACGGTC 
AAGGCTGGCTTGTGCCAGAACGGCGCGCGCGCGCGCACGCACGCACACACACGGGGGGAAAC 
TTTTTTAAAAATGAAAGGCTAGAAGAGCTCAGCGGCGGCGCGGGCGCTGCGCGAGGGCTCCG 
GAGCTGACTCGCCGAGGCAGGAAATCCCTCCGGTCGCGACGCCCGGCCCCGGCTCGGCGCCC 
GCGTGGGATGGTGCAGCGCTCGCCGCCGGGCCCGAGAGCTGCTGCACTGAAGGCCGGCGACG 
aiS GCAGCGCGCCCGCTGCCCGTGTCCCCCGCCCGCGCCCTCCTGCTCGCCCTGGCCGGTGC 
T C TG CT CGCGCCCTG CGAGG C C CG AGGGGTGAGCTTATGG AAC CAAGGAAGAGCTGATGAAG 

TTGTCAGTGCCTCTGTTCGGAGTGGGGACCTCTGGATCCCAGTGAAGAGCTTCGACTCCAAG 
AAT CAT C C AGAAGTG CT GAATATTCGACTACAACGGGAAAG CAAAGAACTGATCATAAATCT 
GGAAAGAAATGAAGGT CT CATTGCCAGCAGTTTCACGGAAACC CACTATCTGCAAGACGGTA 
CTGATGTCTCCCTCGCTCGAAATTACACGGGTCACTGTTACTACCATGGACATGTACGGGGA 
TATTCTGATTCAGCAGTCAGTCTCAGCACGTGTTCTGGTCTCAGGGGACTTATTGTGTTTGA 
AAATG AAAG CT ATGT CTT AGAAC CAATG AAAAGTGCAAC CAACAGATACAAACT CTT CCCAG 
C GAAGAAG CT G AAAAG C GT C CGGGGAT CATGTGGAT CACAT C ACAACACACCAAAC CTCGCT 
GCAAAG AATGTGTTT CCACC ACCCTCTCAGACATGGGCAAGAAGGCATAAAAGAGAGACC CT 
CAAGGCAACTAAGTATGTGGAGCTGGTGATCGTGGCAGACAACCGAGAGTTTCAGAGGCAAG 
GAAAAG AT CTGGAAAAAG TT AAGCAGCGATTAAT AGAG ATTGCTAAT CACGTTGACAAGTTT 

TACAGACCACTGAACATTCGGATCGTGTTGGTAGGCGTGGAAGTGTGGAATGACATGGACAA 
ATGCTCTGTAAGTCAGGACCCATTCACCAGCCTCCATGAATTTCTGGACTGGAGGAAGATGA 
AGCTT CTA C CTCGCAAAT C C CATGACAATGCGCAGCTTGT CAGTGGGGTTTATTTC CAAGGG 
AC C AC CAT CGGCATGGC C C C AATCATGAGCATGTGCACGGCAGACCAGTCTGGGGGAATTGT 
CATGGACCATTCAGACAATCCCCTTGGTGCAGCCGTGACCCTGGCACATGAGCTGGGCCACA 
ATTTCGGGATGAATCATGACACACTGGACAGGGGCTGTAGCTGTCAAATGGCGGTTGAGAAA 
GGAGG CTG CAT CATGAACG CTT C C ACCGGGTAC CCATTTC C C ATGGTGTT CAGCAGTTGCAG 

CAGGAAGGACTTGGAGACCAGCCTGGAGAAAGGAATGGGGGTGTGCCTGTTTAACCTGCCGG 
AAGT C AGGGAGT CTTTCGGGGGCCAGAAGTGTGGGAACAGATTTGTGGAAGAAGGAGAGGAG 
TGTGACTGT GGGGAG C C AG AGGAATGTATGAATCG CTGCTG CAATGC CAC CACCTGTAC C CT 
GAAGCCGGACGCTGTGTGCGCACATGGGCTGTGCTGTGAAGACTGCCAGCTGAAGCCTGCAG 
GAACAGCGTGCAGGGACTCCAGCAACTCCTGTGACCTCCCAGAGTTCTGCACAGGGGCCAGC 
CCTCACTGCCCAGCCAATGTGTACCTGCACGATGGGCACTCATGTCAGGATGTGGACGGCTA 
CTGCTACAATGGCATCTGCCAGACTCACGAGCAGCAGTGTGTCACGCTCTGGGGACCAGGTG 
CTAAAC CTG C C C CTGGG AT CTG CTTTGAGAGAGTCAATTCTGCAGGTGATCCTTATGGCAAC 
TGTGGCAAAGT CT CGAAGAGTTCCTTTGCCAAATGCGAGATGAGAGATGCTAAATGTGGAAA 
AATCCAGTGTCAAGGAGGTGCCAGCCGGCCAGTCATTGGTACCAATGCCGTTTCCATAGAAA 
CAAACATCCCTCTGCAGCAAGGAGGCCGGATTCTGTGCCGGGGGACCCACGTGTACTTGGGC 
GATGACATGCCGGACCCAGGGCTTGTGCTTGCAGGCACAAAGTGTGCAGATGGAAAAATCTG 
CCTGAATCGTCAATGTCAAAATATTAGTGTCTTTGGGGTTCACGAGTGTGCAATGCAGTGCC 
ACGGCAGAGGGGTGTGCAACAACAGGAAGAACTGCCACTGCGAGGCCCACTGGGCACCTCCC 
TTCTGTGACAAGTTTGGCTTTGGAGGAAGCACAGACAGCGGCCCCATCCGGCAAGCAGAAGC 
AAGGCAGGAAGCTGCAGAGTCCAACAGGGAGCGCGGCCAGGGCCAGGAGCCCGTGGGATCGC 
AGGAGCATGCGTCTACTGCCTCACTGACACTCATCIS&GCCCTCCCATGACATGGAGACCGT 
GACCAGTGCTGCTGCAGAGGAGGTCACGCGTCCCCAAGGCCTCCTGTGACTGGCAGCATTGA 
CTCTGTGGCTTTGCCATCGTTTCCATGACAACAGACACAACACAGTTCTCGGGGCTCAGGAG 
GGGAAGTCCAGCCTACCAGGCACGTCTGCAGAAACAGTGCAAGGAAGGGCAGCGACTTCCTG 
GTTGAGCTTCTGCTAAAACATGGACATGCTTCAGTGCTGCTCCTGAGAGAGTAGCAGGTTAC 
C ACT CTGGC AGG C C C C AG C C CTGCAG CAAGGAGGAAGAGGACT CAAAAGTCTGGCCTTT CAC 

TGAGCCTCCACAGCAGTGGGGGAGAAGCAAGGGTTGGGCCCAGTGTCCCCTTTCCCCAGTGA 
CACCT CAGC CTTGGCAG CC CTGATGACTGGTCTCTGGCTGCAACTTAATGCTCTGATATGGC 
TTTTAGCATTTATTATATGAAAATAGCAGGGTTTTAGTTTTTAATTTATCAGAGACCCTGCC 
AC C CATT C CAT CT C CAT C C AAGC AAACTGAATGGCAATGAAACAAACTGGAGAAGAAGGTAG 
GAGAAAGGGCGGTGAACTCTGGCTCTTTGCTGTGGACATGCGTGACCAGCAGTACTCAGGTT 
TGAGGGTTTGCAG AAAG C C AGGGAACCCACAGAGT CAC CAAC C CTT CATTTAACAAGTAAGA 

ATGTTAAAAAGTGAAAACAATGTAAGAGCCTAACTCCATCCCCCGTGGCCATTACTGCATAA 
AATAGAGTGCATTTGAAAT 
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FIGURE 30 

></usr/seqdb2/sst/DNA/Dnaseqs -min/ss .DNA49624 
xsubunit 1 of 1, 735 aa, 1 stop 
XMW: 80177, pi: 7.08, NX(S/T): 5 
MAARPLPVSPARALLLALAGALIAPCEARGVS 

NHPEVLNIRLQRESKELIINLERNEGLIASSFTETHYLCDGTDVSIJ^RNYTGKCYYHGHVRG 
YS DS AVS LS T CS GLRGL I VFENES YVLEPMKSATNRYKLFPAKKLKSVRGSCGSHHNTPNLA 
AKNVFP P P 5 QT WARRHKRE T LKATKYVE LVI VADNRE FQRQGKDLEKVKQRL I E I ANHVDKF 

YRPLNIRIVLVGVEVWNDMDKCSVSQDPFTSLHEFLDWRKMKLLPRKSHDNAQLVSGVYFQG 
TTIGMAPIMSMCTADQSGGIVMDHSDNPLGAAV^ 

GGCIMNASTGYPFPMVFSSCSRKDLETSLEKGMGVCLFNLPEVRESFGGQKCGNRFVEEGEE 
CDCGEPEECMNRCCNATTCTLKPDAVCAHGLCCEDCQLKPAGTACRDSSNSCDLPE FCTGAS 
PHCPANVYLHDGH S CQDVDGYCYNG I C QTHEQQCVTLWGPGAKPAPG I C FERVNS AGDP YGN 
CGKVS KS S FAKCEMRDAKCGKI QCQGGASRPVI GTNAVS IETNI PLQQGGRI LCRGTHVYLG 
DDMPDPGLVLAGTKCADGKICLNRQCQNISVFGVHECAMQCHGRGVCNNRKNCHCEAHWAPP 
FCDKFGFGGSTDSGPIRQAEARQEAAESNRERGQGQEPVGSQEHASTASLTLI 
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FIG UR E 31 



TCCCAAGGCTTCTTGGATGGCAGATGATTNTGGGGTTTTGCATTGTTTCCCTGACAACGAAA 
AC AAAAC AG TTTTGGGGGTT C AG GAGGGGAANT C C AG C C T AC C CAGGAAGT T T GC AGAAAC A 
G T G C AAGGAAG G G C AG G ANT TCCTGGTT G AGNT T T T T GNT AAAAC AT GGACAT GNT TCAGT G 
CTGCTCNTGAGAGAGTAGCAGGTTACCACTTTTGGCAGGCCCCAGCCCTGCAGCAAGGAGGA 
AGAGGACTCAAAAGTTTGGCCTTTCACTGAGCCTCCACAGCAGTGGGGGAGAAGCAAGGGTT 
GGGCCCAGTGTCCCCTTTCCCCAGTGACACCTCAGCCTTGGCAGCCCTGATAACTGGTNTNT 
GG C T G CAANT T AAT G C TNT G ATAT G GC T T T TAG CAT T TAT TAT AT GAAAATAGCAGGG T T T T 
AGTTTTTAATTTATCAGAGACCCTGCCACCCATTCCATNTCCATCCAAG 
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FIGURE 32 



CAT C CT G C AACATGGT GAAAC CACG C CTGGCTAATTTT GTTGTATTTTTGGT AGAGATGGGA ' 

TTTCACCGTGTTAGCCAGGATTGTCTCAATCTGACCTCATGATCTGCCCGCCTCGGCCTCCC 

AAAGT G CTGGGATTACAGG C GAGTGCAAC CACAC C CGGC CACAAACTTTTTAAGAAGTTAAT 

GAAACCATACCTTTTACATTTTTAATGACAGGAAAATGCTCACAATAATTGTTAACCCAAAA 

TTCTGGATACAAAAGTACAATCTTTACTGTGTAAATACATGTATATGTACTATATGAAAATA 

TACCAAATATCAATAATACTTATCTCTGGGTAAAAACCTCTTCTCATACCCTGTGCTAAC^A 

CTTTTAACAAAAAATTTGCATCACTTTTAAGAATCAAGAAAAATT7CTGAAGGTCATATGGG 

ACAGAAAAAAAAACCAAGGGAAAAATCACGCCACTTGGGAAAAAAAGATTCGAAATCTGCCT 

TTTTATAGATTTGTAATTAATAAGGTCCAGGCTTTCTAAGCAACTTAAATGTTTTGTTTCGA 

AACAAAGTACTTGTCTGGATGTAGGAGGAAAGGGAGTGATGTCACTGCCATTATGATGCCCC 

TTGAATATAAGACCCTACTTGCTATCTCCCCTGCACCAGCCAGGAGCCACCCATCCTCCAGC 

AC ACT GAG C AG C AAGCTGGACACACGG CACACTGAT C CAA&ESGGTAAGGGGATGGTGGCGA 

TGCTCATTCTGGGTCTGCTACTTCTGGCGCTGCTCCTACCCGTGCAGGTTTCTTCATTTGTT 

CCTTTAACCAGTATGCCGGAAGCTACTGCAGCCGAAACCACAAAGCCCTCCAACAGTGCCCT 

ACAGCCTACAGCCGGTCTCCTTGTGGTCTTGCTTGCCCTTCTACATCTCTACCAT3ja&GAGG 

C AGGT C AAGAAAC AGCT AC AGTT CTC CAAC C C ATACACT AAAACCGAATC CAAATGGT GCCT 

AGAAGTT CAATGTGGCAAGGAAAAAAACCAGGT CTTCATCAAATCTACTAATTT CACT CCTT 

ATTAACAGAGAAACGCTTGAGAGTCTCAAACTGGACTGGTTTAAAGAGCATCTGAAGGATTT 

GACTAGATGATAAATGCCTGTACTCCCAGTACTTTGGGAGGCCTAGGCCGGCGGATCACCTG 

AGGTCAGGAGTTTGAGACTAACCTGGCCAAAATGGTGAAACCCCATCTGTACTAAAAATACA 

AATATTGACTGGGCGTGGTGGTGAGTGCCTGTGATCCCAGCTACTCAGGTGGCTGAAGCAGG 

ACAATCACTTGAACTCAGGAGGCAGAGGTTGCAGTGAGCTGAGATCGCGCTACTGCACTCTA 

GCCTAGCCTGGGCAACAGAGTGAGACTTCGTCTCAAAAAAAAAAAAGCCAAGTGCAGTGGCT 

CACGCCTGTAATCCCGGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCAGGAGATCA 

AGACCATCCTGGCTAATACAGTGAAACCCTGTCTCTACTAAAAATACAAAAAATTAGCCGGG 

GATGGTGGCAGGCACCTGGAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATAGCGTGAA 

CTCAGGAGGCGGAGCTTGCAGTGAGCCGAGATTGCGCTACTGCACTCCAGCCTGGGCGACAG 

CGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAA 




WO 99/46281 PCT/US99/0S028 

FI GURE 33 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA48309 
xsubunit 1 of l, 67 aa, 1 stop 
><MW: 6931, pi: 7,47, NX(S/T): 0 

MGKGMVAMLILGLLLLALLLPVQVSSFVPLTSMPEATi^TTKPSNSALQPTAGLLVVLL^ 
LHLYH 



WO 99/4628 1 PCT/US99/05028 

FIGURE 34 

GCCGCGGCGAGAGCGCGCCCAGCCCCGCCGCGaa^CCGCGCGCCCftGGACGCCTCCTCCCG 
CTGCTGGCCCGGCCGGCGGCCCTGACTGCGCTGCTGCTGCTGCTGCTGGGCCATGGCGGCGG 
CGGGCGCTGGGGCGCCCGGGCCCAGGAGGCGGCGGCGGCGGCGGCGGACGGGCCCCCCGCGG 
CAGACGGCGAGGACGGACAGGACCCGCACAGCAAGCACCTGTACACGGCCGACATGTTCACG 
CACGGGATCCAGAGCGCCGCGCACTTCGTCATGTTCTTCGCGCCCTGGTGTGGACACTGCCA 
GCGGCTGCAGCCGACTTGGAATGACCTGGGAGACAAATACAACAGCATGGAAGATGCCAAAG 

TCTATGTGGCTAAAGTGGACTGCACGGCCCACTCCGACGTGTGCTCCGCCCAGGGGGTGCG^ 
G GAT AC C C C AC C T T AAAG C T T T T C AAG C CAG GC C AAGAAGC T G T GAAGTAC CAGGG T C C T C G 
GGAC T T C CAGAC AC T GGAAAAC TGGAT GC T GCAGACAC T GAAC GAGGAGC CAGT GACACCAG 

AGCCGGAAGTGGAACCGCCCAGTGCCCCCGAGCTCAAGCAAGGGCTGTATGAGCTCTCAGCA 
AGCAACTTTGAGCTGCACGTTGCACAAGGCGACCACTTTATCAAGTTCTTCGCTCCGTGGTG 
TGGTCACTGCAAAGCCCTGGCTCCAACCTGGGAGCAGCTGGCTCTGGGCCTTGAACATTCCG 
AAACTGTCAAGATTGGCAAGGTTGATTGTACACAGCACTATGAACTCTGCTCCGGAAACCAG 
GTTCGTGGCTATCCCACTCTTCTCTGGTTCCGAGATGGGAAAAAGGTGGATCAGTACAAGGG 
AAAGCGGGATTTGGAGTCACTGAGGGAGTACGTGGAGTCGCAGCTGCAGCGCACAGAGACTG 
GAGCGACGGAGACCGTCACGCCCTCAGAGGCCCCGGTGCTGGCAGCTGAGCCCGAGGCTGAC 
AAGGGCAC T G T G T T GGCAC T CACT GAAAAT AACT T C GATGAC AC CAT T GCAGAAGGAATAAC 

CTTCATCAAGTTTTATGCTCCATGGTGTGGTCATTGTAAGACTCTGGCTCCTACTTGGGAGG 
AACTCTCTAAAAAGGAATTCCCTGGTCTGGCGGGGGTCAAGATCGCCGAAGTAGACTGCACT 
GCTGAACGGAATATCTGCAGCAAGTATTCGGTACGAGGCTACCCCACGTTATTGCTTTTCCG 
AGGAGGGAAGAAAG T CAG T GAGCACAGT GGAGGC AGAGACC T T GACT C GT TACAC C GC TTT G 

TCCTGAGCCAAGCGAAAGACGAACTTT^SGAACACAGTTGGAGGTCACCTCTCCTGCCCAGC 

TCCCGCACCCTGCGTTTAGGAGTTCAGTCCCACAGAGGCCACTGGGTTCCCAGTGGTGGCTG 

TTCAGAAAGCAGAACATACTAAGCGTGAGGTATCTTCTTTGTGTGTGTGTTTTCCAAGCCAA 

CACACTCTACAGATTCTTTATTAAGTTAAGTTTCTCTAAGTAAATGTGTAACTCATGGTCAC 

TGTGTAAACATTTTCAGTGGCGATATATCCCCTTTGACCTTCTCTTGATGAAATTTACATGG 

TTTCCTTTGAGACTAAAATAGCGTTGAGGGAAATGAAATTGCTGGACTATTTGTGGCTCCTG 

AGTTGAGTGATTTTGGTGAAAGAAAGCACATCCAAAGCATAGTTTACCTGCCCACGAGTTCT 

GGAAAGGTGGCCTTGT'GGCAGTATTGACGTTCCTCTGATCTTAAGGTCACAGTTGACTCAAT 
AC TGTGTTGGTCCG TAG CAT GGAGC AGAT T GAAAT GCAAAAAC C C ACAC C T C T G GAAGATAC 

CTTCACGGCCGCTGCTGGAGCTTCTGTTGCTGTGAATACTTCTCTCAGTGTGAGAGGTTAGC 
CGTGATGAAAGCAGCGTTACTTCTGACCGTGCCTGAGTAAGAGAATGCTGATGCCATAACTT 
TATGTGTCGATACTTGTCAAATCAGTTACTGTTCAGGGGATCCTTCTGTTTCTCACGGGGTG 
AAACATGTCTTTAGTTCCTCATGTTAACACGAAGCCAGAGCCCACATGAACTGTTGGATGTC 
TTCCTTAGAAAGGGTAGGCATGGAAAATTCCACGAGGCTCATTCTCAGTATCTCATTAACTC 
ATTGAAAGATTCCAGTTGTATTTGTCACCTGGGGTGACAAGACCAGACAGGCTTTCCCAGGC 
CTGGGTATCCAGGGAGGCTCTGCAGCCCTGCTGAAGGGCCCTAACTAGAGTTCTAGAGTTTC 
TGATTCTGTTTCTCAGTAGTCCTTTTAGAGGCTTGCTATACTTGGTCTGCTTCAAGGAGGTC 
GACCTTCTAATGTATGAAGAATGGGATGCATTTGATCTCAAGACCAAAGACAGATGTCAGTG 
GGCTGCTCTGGCCCTGGTGTGCACGGCTGTGGCAGCTGTTGATGCCAGTGTCCTCTAACTCA 
TGCTGTCCTTGTGATTAAACACCTCTATCTCCCTTGGGAATAAGCACATACAGGCTTAAGCT 
C T AAGATAGAT AG GTGTTTGTCCTTT T AC CAT C GAGC TAC T T C C C ATAATAAC C AC T T T GCA 
T C C AAC AC T C T T C AC C C AC C T C C CAT AC G CAAGGGG AT G T GGAT AC T T GGC C CAAAG TAAC T 
GG T G G TAG GAA T C T TAGAAAC AAGAC CAC T T ATAC TGTCTGTCT GAG GC AGAAGATAACAGC 
AGCATCTCGACCAGCCTCTGCCTTAAAGGAAATCTTTATTAATCACGTATGGTTCACAGATA 
ATTCTTTTTTTAAAAAAACCCAACCTCCTAGAGTUIGCACAACTGTCAAGAGTCTTGTACACA 
CAACTTCAGCTTTGCATCACGAGTCTTGTATTCCAAGAAAATCAAAGTGGTACAATTTGTTT 
GTTTACACTATGATACTTTCTAAATAAACTCTTTTTTTTTAA 



2>4 1*21 
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FIGURE 35 

></usr/seqdb2/sst/DNA/Dnaseqs . oiin/ss . DNA46776 
xsubunit 1 of 1, 432 aa, i stop 

><MW: 47629, pi; 5,90, NX(S/T): 0 

MPARPGRLLPLLARPAALTALLLLLLGHGGGGRWGARAQEAAAAAADGPPAADGEDGQDPH^ 
XHL YT ADMFTKG I QSAAHFVMFFAPWCGHCQRLQPTWNDLGDKYNSMEDAKVYVAKVDCTAH 
SDVCSAQGVRGYPTLKLFKPGQEAVKYQGPRDFQ 

LKQGL YELS ASNFELHVAQGDHF IKFFAPWCGHCKALAPTWEQLALGLEHS ETVKI GKVDCT 

QHYELCSGNQVRGYPTLLWFRDGKKVDQYKGKRDLESLREYVESQLQRTETGATETVTPSEA 
PVLAAE P E AD KGT VL ALTENNFDDT I AEG I TF I KF YAPWCGHCKTLAPTWEELS KKEFPGLA 

GVKIAEVDCTAERNICSKYSVRGYPTLLLFRGGKKVSEHSGGRDLDSLHRFVLSQAKDEL 



351*2* 
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FIGURE 36 

CT T T T C T GAG G AAC C AC AG C AATQAAT GGC TT T GCAT CCTTGCTTC GAAGAAAC C AAT T TAT 
CCTCCTGG T AC TAT TTCTTTTG CAAAT T C AGAGT C T GGGT C T GGAT AT T GAT AGC C GT C CTA 
C C GC T GAAG T CT GTGC C AC AC AC AC AAT T T C AC C AGGAC C CAAAGGAGAT GAT GGT GAAAAA 
CGAGAT C C AGGAGAAGAG GGAAAGCAT GGCAAAG TGGGAC GCAT GGGGC C GAAAGGAATTAA 
AGGAGAACTGGGTGATATGGGAGATCAGGGCAATATTGGCAAGACTGGGCCCATTGGGAAGA 
AGGGTGACAAAGGGGAAAAAGGTTTGCTTGGAATACCTGGAGAAAAAGGCAAAGCAGGTACT 
GTCTGTGATTGTGGAAGATACCGGAAATTTGTTGGACAACTGGATATTAGTATTGCTCGGCT 
C AAG AC AT C TAT GAAG T T T G T CAAGAATGT GATAGC AGGGAT T AGGGAAAC T GAAGAG AAAT 
T C TAG T AC AT CG T G C AGG AAG AGAAGAAC TAC AGGGAAT C CC T AAC C CACT GC AGGAT T C GG 
GGT GGAATGCTAGC CAT GCCCAAGGATGAAGCTGCCAACACACT CATC GCTGAC TAT GTTGC 
C AAGAG T GGC TTCTTTCGGGTGTT CAT T GGC G T GAAT GAC C T T GAAAGGGAGGGACAG T ACA 
TGTCCACAGACAACACTCCACTGCAGAACTATAGCAA.CTGGAATGAGGGGGAACCCAGCGAC 
CCCTATGGTCATGAGGACTGTGTGGAGATGCTGAGCTCTGGCAGATGGAATGACACAGAGTG 
CCATCTTACCATGTACTTTGTCTGTGAGTTCATCAA.GAAGAAAAAGTMCTTCCCTCATCCT 
ACGTATTTGCTATTTTCCTGTGACCGTCATTACAGTTATTGTTATCCATCCTTTTTTTCCTG 
ATTGTACTACATTTGATCTGAGTCAACATAGCTAGAAAATGCTAAACTGAGGTATGGAGCCT 
C CAT CAT C AAAAAAAAAAAAAAAA 



WO 99/46281 PCTAJS99/05028 

FIGURE 37 

>< / usr / seqdh2 / sst /DNA/Dnaseqs . min/ss . DNA5 0 980 
xsubunit I of 1, 277 aa, 1 stop 
><MW: 30645, pi: 7,47, NX(S/T): 2 

MNGFASLLHRNQFILLVLFLLQIQSLGLDIDSRPTAEVCATHTISPGPKGDDGEKGDPGEEG 
KHCSKVGRMGPKGIKGELGDMGDQGNIGKTGPIGKKGDKGEXGLLGIPGEKGKAGTVCDCGRY 
RKFVGQLD I S I ARLKTSMKFVK3WIAGIRETEEKFYYIVQEEKNYRESLTHCRIRGGMLAMP 
KDEAANTLIADYVAKSGFFRVFIGVNDLEREGQYMSTDlSrrPLQISrYS 
VEMLS S GRWNDTE CHLTMYFVCE F I XKKK 



WO 99/46281 



PCTAJS99/Q502S 



GGTTCTATCGATTCGAATTCGGCCACACTQGCCGGATCCTCTAGAGATCCCTCGACCTCGAC 
CCACGCGTCCGCTGCTCTCCGCCCGTGTGGAGTGGTGGGGGCCTGGGTGGGA&ISGGCGTGT 
GCCAGCGCACGCGCGCTCCCTGGAAGGAGAAGTCTCAGCTAGAACGAGCGGCCCTAGGTTTT 
CGGAAGGGAGGATCAGGGATGTTTGCGAGCGGCTGGAACCAGACGGTGCCGATAGAGGAAGC 
GGGCTCCATGGCTGCCCTCCTGCTGCTGCCCCTGCTGCTGTTGCTACCGCTGCTGCTGCTGA 
AGCTACACCTCTGGCCGCAGTTGCGCTGGCTTCCGGCGGACTTGGCCTTTGCGGTGCGAGCT 
CTGTGCTGCAAAAGGGCTCTTCGAGCTCGCGCCCTGGCCGCGGCTGCCGCCGACCCGGAAGG 
TCCCGAGGGGGGCTGCAGCCTGGCCTGGCGCCTCGCGGAACTGGCCCAGCAGCGCGCCGCGC 
ACAC CTTTCT CATT CACGGCTCGCGGCGCTTTAGCTACTCAGAGGCGGAGCGCGAGAGTAAC 
AGGGCTGCACGCGCCTTCCTACGTGCGCTAGGCTGGGACTGGGGACCCGACGGCGGCGACAG 
GGGCGAGGGGAGCGCTGGAGAAGGCGAGCGGGCAGCGCCGGGAGCCGGAGATGCAGCGGCCG 
GAAGCGGCGCGGAGTTTGCCGGAGGGGACGGTGCCGCCAGAGGTGGAGGAGCCGCCGCCCCT 
CTGTCACCTGGAGCAACTGTGGCGCTGCTCCTCCCCGCTGGCCCAGAGTTTCTGTGGCTCTG 
GTTCGGGCTGGCCAAGGCCGGCCTGCGCACTGCCTTTGTGCCCACCGCCCTGCGCCGGGGCC 
CCCTGCTGCACTGCCTCCGCAGCTGCGGCGCGCGCGCGCTGGTGCTGGCGCCAGAGTTTCTG 
GAGTCCCTGGAGCCGGACCTGCCCGCCCTGAGAGCCATGGGGCTCCACCTGTGGGCTGCAGG 
CCCAGGAACCCACCCTGCTGGAATTAGCGATTTGCTGGCTGAAGTGTCCGCTGAAGTGGATG 
GGCCAGTG C CAGGAT ACCT CTCTTCCCC CCAGAGCAT AACAGACACGTGCCTGTACAT CTT C 
ACCTCTGGCACCACGGGCCTCCCCAAGGCTGCTCGGATCAGTCATCTGAAGATCCTGCAATG 
CCAGGGCTTCTATCAGCTGTGTGGTGTCCACCAGGAAGATGTGATCTACCTCGCCCTCCCAC 
TCTACCACATGTGCGGTTCCCTGCTGGGCATCGTGGGCTGCATGGGCATTGGGGCGACAGTG 
GTGCTGAAATCCAAGTTCTCGGCTGGTCAGTTCTGGGAAGATTGCCAGCAGCACAGGGTGAC 
GGTGTT C CAGT AC ATTGGGGAGCTGTGC CGATAC CTTGTCAACCAGCCCCCGAGCAAGG CAG 
AACGTGGCCATAAGGTCCGGCTGGCAGTGGGCAGCGGGCTGCGCCCAGATACCTGGGAGCGT 
TTTGTGCGGCG CTT CGGGC C CCTGCAGGTGCTGGAGACATATGGACTGACAGAGGGCAACGT 
GGCCACCATCAACTACACAGGACAGCGGGGCGCTGTGGGGCGTGCTTCCTGGCTTTACAAGC 
ATAT CTT CC C CTT CT C CTTGATTCGCTATGATGTCAC CACAGGAGAGCCAATT CGGGACCC C 
CAGGGGCACTGTATGGCCACATCTCCAGGTGAGCCAGGGCTGCTGGTGGCCCCGGTAAGCCA 
GCAGTCCCCATTCCTGGGCTATGCTGGCGGGCCAGAGCTGGCCCAGGGGAAGTTGCTAAAGG 
ATGTCTTCCGGCCTGGGGATGTTTTCTTCAACACTGGGGACCTGCTGGTCTGCGATGACCAA 
GGTTTT CT C CGCT T C C ATGAT CGTACTGGAGAC AC CTT CAGGTGGAAGGGGGAGAATGTGGC 
CACAAC CG AGGTGG C AGAGGTCTTCGAGGC C CTAGATTTTCTT CAGGAGGTGAACGTCTATG 
GAGTCACTGTGCCAGGGCATGAAGGCAGGGCTGGAATGGCAGCCCTAGTTCTGCGTCCCCCC 
CACGCTTTGGACCTTATGCAGCTCTACACCCACGTGTCTGAGAACTTGCCACCTTATGCCCG 
GCCC CGATTCCT CAGGCTC CAGGAGTCTTTGGCCACCACAGAGACCTTCAAACAGCAGAAAG 
TTCGGATGGCAAATGAGGGCTTCGACCCCAGCACCCTGTCTGACCCACTGTACGTTCTGGAC 
CAGGCTGTAGGTGCCTACCTGCCCCTCACAACTGCCCGGTACAGCGCCCTCCTGGCAGGAAA 
C CTT CGAAT C*£S&G AACTT C CACACCTGAGGCACCTGAGAGAGGAACTCTGTGGGGTGGGGG 
CCGTTGCAGGTGTACTGGGCTGTCAGGGATCTTTTCTATACCAGAACTGCGGTCACTATTTT 
GTAATAAATGTGGCTGGAGCTGATCCAGCTGTCTCTGACCTAAAAAAAAAAAAAAAAAA?U^A 
AAAAAAAAAGGG CGGC CGCGACT CTAGAGTCGAC CTGC AGTAGGGAT AAC AGGGTAAT AAGC 
TTGGCCGCCATGGCCCAACTTGTTTATTGCAG 
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FIGURE 39 



></usr/seqdb2/sst/DNA/Dnaseqs . min/ ss .DNA5 0913 
xsubunit 1 of 1, 730 aa, 1 stop 
xMW: 78644, pi: 7.65, NX{S/T): 2 
MGVCQRTRAPWKEKSQLERAALGFRXGGSGMFASGWNQTV^ 
LLLKLHLWPQLRWLPADLAFAVRMjCCKRAL 

RAAHTFLIHGSRRFSYSEAERESNRAARAFLRALGVTOWGPDGGDSGEGSAGEGERAAPGAGD 
AAAGSGAEFAGGDGAARGGGAAAPLSPGATVALLLPAGPEFLWLWFGLAKAGLRTAFVPTAL 
RRGPLLHCLRSCGARALVLAPEFLESLEPDLPALRAMGLHLWAAGPGTHPAGISDLLAEVSA 
EVDGPVPGYLSSPQSITDTCLYIFTSGTTGLPKAARISHLKILQCQGFYQLCGVHQEDVIYL 
ALPLYHMSGSLLGIVGCMGIGATVVLKSKFSAGQFWEDCQQHRVTVFQYIGELCRYLVNQPP 
S KAERGHKVRL AVG SGLR P DTWERFVRRFGPLQVLETYGLTE GNVAT I NYTGQRGAVGRASW 
LYKHI FPFSL IRYDVTTGEP IRDPQGHCMATSPGEPGLLVAPVSQQSPFLGYAGGPELAQGK 
LLKDVFRPGDVFFNTGDLLVCDDQGFLRFHDRTGDTFRWKGENVATTEVAEVFEALDFLQEV 
I^GVTVPGHEGRAGMAALVLRPPHALDLM 

QQKVRMANEGFDPSTLSDPLYVLDQAVGAYLPLTTARYSALLAGNLRI 

Signal peptide: 
aa 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation site 

starting at aa 136 

CUB domain protein motif 

aa 254-261 

putative AMP-binding domain siganture 

aa 332-343 

N-glycosylation sites 

aa 37-40 and 483-486 
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RGURE 40 



CCTGTGTTAAG CTGAGGTTT CC C CTAGATCTCGT ATATCC C CAACACATACCTCCACGC&CA 
CACAT C C C CAAGAA C CT C G AGCT CACACCAACAG ACACACG CGCGCATACACACTCGCTCTC 
GCTTGTCCATCTCCCTCCGGGGGGAGCCGGCGCGCGCTCCCACCTTTGCCGCACACTCCGGC 
GAG C C G AGC C CG CAG CG CT C CAGGATTCTGCGGCT CGGAACTCGGATTGCAGCTCTGAACC C 
CCATGGTGGTTTTTTAAACACTTCTTTTCCTTCTCTTCCTCGTTTTGATTGCACCGTTTCCA 
TCT GGGGG CT AGAGG AG CAAGG CAGCAGCCTTCCC AG C C AGCCCTTGTTGGCTTGC C ATCGT 
CCATCTGGCTTATAAAAGTTTGCTGAGCGCAGTCCAGAGGGCTGCGCTGCTCGTCCCCTCGG 
CTGGCAGAAGGGGGTGACGCTGGGCAGCGGCGAGGAGCGCGCCGCTGCCTCTGGCGGGCTTT 
CGGCTTGAGGGGCAAGGTGAAGAGCGCACCGGCCGTGGGGTTTACCGAGCTGGATTTGTATG 
TTGCACCAISCCTTCTTGGATCGGGGCTGTGATTCTTCCCCTCTTGGGGCTGCTGCTCTCCC 
TCCCCGCCGGGGCGGATGTGAAGGCTCGGAGCTGCGGAGAGGTCCGCCAGGCGTACGGTGCC 
AAGGGATT C AGC CT GG C GG ACAT CC C CTACCAGG AGATCGCAGGGGAACACTTAAGAAT CTG 
T CCT C AGGAAT AT AC ATG CT GCACCACAGAAATGGAAGACAAGTTAAGC CAACAAAGCAAAC 
TCGAATTTGAAAACCTTGTGGAAGAGACAAGCCATTTTGTGCGCACCACTTTTGTGTCCAGG 
CATAAGAAATTTGACGAATTTTTCCGAGAGCTCCTGGAGAATGCAGAAAAGTCACTAAATGA 
TATGTTTGTACGGACCTATGGCATGCTGTACATGCAGAATTCAGAAGTCTTCCAGGACCTCT 
T CAC AG AGCT GAAAAGGTACTACACTGGGGGTAATGTGAAT CTGGAGGAAATGCTCAATGAC 
TTTT GGGCT C GG CT C CT GGAACGGATGTTTCAGCTGATAAACC CTCAGTATCACTT C AGTGA 
AGACTACCT GGAAT GTGT GAGCAAATACACTGACCAG CTCAAGCCATTTGGAGACGTG CCCC 
GGAAACTGAAGATTCAGGTTACCCGCGCCTTCATTGCTGCCAGGACCTTTGTCCAGGGGCTG 
ACTGTGGGCAGAGAAGTTGCAAACCGAGTTTCCAAGGTCAGCCCAACCCCAGGGTGTATCCG 
TGCCCTCATGAAGATGCTGTACTGCCCATACTGTCGGGGGCTTCCCACTGTGAGGCCCTGCA 
ACAACTACTGTCTCAACGTCATGAAGGGCTGCTTGGCAAATCAGGCTGACCTCGACACAGAG 
TGGAATCTGTTTATAGATGCAATGCTCTTGGTGGCAGAGCGACTGGAGGGGCCATTCAACAT 
TGAGTCGGT CATGGACC CGATAGATGTCAAGATTTCTGAAGCCATTATGAACATGCAAGAAA 
ACAGCATGCAGGTGTCTGCAAAGGTCTTTCAGGGATGTGGTCAGCCCAAACCTGCTCCAGCC 
CTCAGATCTGCCCGCTCAGCTCCTGAAAATTTTAATACACGTTTCAGGCCCTACAATCCTGA 
GGAAAGAC CAAGAACT G CTGCAGGCACAAGCTTGGACCGG CTGGT CACAGACATAAAAGAGA 
AATT GAAGCT CT CT AAAAAGGTCTGGT CAGCATTACC CTACACTATCTGCAAGGACGAGAGC 
GTGACAGCGGGCACGTCCAACGAGGAGGAATGCTGGAACGGGCACAGCAAAGCCAGATACTT 
GCCTGAGATCATGAATGATGGGCTCACCAACCAGATCAACAATCCCGAGGTGGATGTGGACA 
TCACT CGG C CTG AC ACTTTC AT CAGACAGCAGATTATGG CT CTCCGTGTGATGAC CAACAAA 
CTAAAAAACG C CTAC AATGG CAATGATGT CAATTTC CAGGACACAAGTGATGAAT C CAGTGG 
CTCAGGGAGTGGCAGTGGGTGCATGGATGACGTGTGTCCCACGGAGTTTGAGTTTGTCACCA 
CAGAGG CC C C C G CAGTGGAT C C CGAC CGGAGAGAGGTGGACTCTTCTGCAGCC CAGCGTGG C 
CACTCCCTGCTCTCCTGGTCTCTCACCTGCATTGTCCTGGCACTGCAGAGACTGTGCAGA2A 
ATCTTGGGTTTTTGGTCAGATGAAACTGCATTTTAGCTATCTGAATGGCCAACTCACTTCTT 
TTCTTACACT CTTGGACAAT GGACCATGCCACAAAAACTTAC CGTTTTCTATGAGAAGAGAG 
CAGTAATGCAAT CTGC CT CC CTTTTTGTTTTCCCAAAGAGTACCGGGTGCCAGACTGAACTG 
CTTC CT CTTT C CTT CAG CT AT CTGTGGGGAC CTTGTTTATT CTAGAGAGAATT CTTACT CAA 
ATTTTT CGTAC C AGG AG ATTTT CTTACCTTCATTTG CTTTT ATG CTGC AGAAGTAAAGGAAT 
CTCACGTTGTGAGGGTTTTTTTTTTCTCATTTAAAAT 
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FIG U RE 41 

x /usr /seqdb2 / sat /DNA/Dnaseqs , min/ ss . DNA50914 
xsubunit 1 of l, 555 aa, 1 stop 
xMW: 62736, pi; 5,36, NX(S/T): 0 

MPSWIGAVILPLLGLLLSLPAGADVKARSCGEVRQAYGAKGFSLADIPYQEIAGEHLRICPQ 
EYTCCTTEMEDKLSQQSKLEFENLVEETSHFWTTFVSRHKKFDEFFRELLENAEKSLNDMF 
WTYGMLYMQNSEVFQDLFTELKRYYTGGNWLEEMI^FWARLLERMFQLINPQYHFSED^ 
LE CVS K YTDQ L KP F GD VPRKLKI Q VTRAF I AARTF VQGLTVGRE VANR VS KVS PTFG C IRAL 
MKMLYCPYCRGLPTVRPCNlTYCLJ^rVM 

VMDPIDVKISEAIMNMQENSMQVSAKVFQGCGQPKPAPALRSARSAPENFNTRFRPYNPEER 
PTTAAGTSLDRLVTD IKEKLKLSKKVWSALPYTI CKDES VTAGTSNEEE CWNGHSKARYLPE 
I MNDGLTNQ I NNP EVDVD I TRPDTF I RQQ IMALRVMTNKLKNAYNGNDVNFQDTSDES SGSG 
SGSGCMDDVCPTEFEFVTTEAPAVDPDRREVDSSAAQRGHSLLSWSLTCIVLALQRLCR 



4A 1 22=h 
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FIG URE 42 A 



CGGACGCGTGGGCGGACGCGTGGGCAAAAGAACTCGGAGTGCCAAAGCTAAATAAGTTAGCT 

GAGAAAACGCACGCAGTTTGCAGCGCCTGCGCCGGGTGCGCCAACTACGCAAAGACCAAGCG 

GGCTCCGCGCGGACCGGCCGCGGGGCTAGGGACCCGGCTTTGGCCTTCAGGCTCCCTAGCAG 

CGGGGAAAAGGAATTGCTGCCCGGAGTTTCTGCGGAGGTGGAGGGAGATCAGGAAACGGCTT 

CTT CCT C ACTT CG C CG C CTGGTGAGTGTCGGGG AGATTGGCAAACGCCTAGGAAAGGACTGG 

GGAAAATAG C C CTGGGAAAGTGGAGAAGGTGAT CAGGAGGC CGGTC CACTACGGCAGTTT AT 

CTGTCTGATCAGAGCCAGACGCGACGCGTCCACTTCGCAGTTCTTTCCAGGTGTGGGGACCG 

CAGGACAGACGG C CG AT C C CGCCGCC CTC CGTACCAGCACT CC CAGGAGAGTCAG CCTCGCT 

CCCCAACGTCGAGGGCGCTCTGGCCACGAAAAGTTCCTGTCCACTGTGATTCTCAATTCCTT 

G CTTGGTTTTTTT CT C C AG AGAACTTTTGGGTGGAGATATTAACTTTTTTCTTTTTTTTTTT 

C CTTGGTGGAAGCTG C T CT AGGGAGGGGGGAGG AGGAGGAGAAAGTGAAATGTGCTGG AGAA 

GAGCGAGCCCTCCTTGTTCTTCCGGAGTCCCATCCATTAAGCCATCACTTCTGGAAGATTAA 

AGTTGT C GGACATGGTGAC AGCTGAGAGGAGAGGAGGATTT CTTGCCAGGTGGAGAGTCTTC 

ACCGTCTGTTGGGTGCATGTGTGCGCCCGCAGCGGCGCGGGGCGCGTGGTTCTCCGCGTGGA 

GTCTCACCTGGGACCTGAGTGAA2SGCTCCCAGGGGCTGTGCGGGGCATCCGCCTCCGCCTT 

CTCCACAGGCCTGTGTCTGTCCTGGAAAGATGCTAGCAATGGGGGCGCTGGCAGGATTCTGG 

AT C CT CTGC CT C CT CACTTATGGTTACCTGTCCTGGGGCCAGGCCTTAGAAGAGGAGGAAGA 

AGGGGCCTTACTAG CT CAAGCTGGAGAGAAACTAGAGC C CAGCACAACTTCCAC CTCCCAGC 

C CC ATCTCATTTT CAT C CTAGCGGATGAT CAGGGATTTAGAGATGTGGGTTACCACGGAT CT 

GAGATTAAAACACCTACTCTTGACAAGCTCGCTGCCGAAGGAGTTAAACTGGAGAACTACTA 

TGTCCAGCCTATTTGCACACCATCCAGGAGTCAGTTTATTACTGGAAAGTATCAGATACACA 

C CGGACTT C AACATT CT AT C ATAAGACCTACCCAAC C CAACTGTTTACCTCTGGACAATGCC 

AC C CTAC CT CAGAAACT GAAGGAGGTTGGATATTCAACGCATATGGTCGGAAAATGGCACTT 

GGGTTTTAACAGAAAAGAATGCATGCCCACCAGAAGAGGATTTGATACCTTTTTTGGTTCCC 

TTTTGGGAAGTGGGGATTACTATACACACTACAAATGTGACAGTCCTGGGATGTGTGGCTAT 

GACTTGTATGAAAACGACAATGCTGCCTGGGACTATGACAATGGCATATACTCCACACAGAT 

GTACACTCAGAGAGTACAGCAAATCTTAGCTTCCCATAACCCCACAAAGCCTATATTTTTAT 

ATACTGCCTATCAAGCTGTTCATTCACCACTGCAAGCTCCTGGCAGGTATTTCGAACACTAC 

CGATC CATTAT CAACATAAACAGGAGAAGATATGCTGC CATGCTTTCCTGCTTAGATGAAGC 

AATCAACAACGTGACATTGGCTCTAAAGACTTATGGTTTCTATAACAACAGCATTATCATTT 

ACTCTT C AGATAATGGT GGCCAGCCTACGGC AGGAGGGAGTAACTGGC CT CTCAGAGGTAGC 

AAAGGAACATATTGGGAAGGAGGGATCCGGGCTGTAGGCTTTGTGCATAGCCCACTTCTGAA 

AAACAAGGGAACAGTGTGTAAGGAACTTGTGCACATCACTGACTGGTACCCCACTCTCATTT 

CACTGGCTGAAGGACAGATTGATGAGGACATTCAACTAGATGGCTATGATATCTGGGAGACC 

ATAAGTGAGGGTCTT CGCT CAC C CCGAGTAGATATTTTGCATAACATTGACC C CTATACACC 

AAGGCAAAAAATGG CT C CTGGGCAGCAGGCTATGGGATCTGGAACACTGCAAT C CAGTCAGC 

CATCAGAGTG C AG C ACTGGAAATTGCTTACAGGAAATC CTGGCT ACAGCGACTGGGTCCCC C 

CT C AGT CTTT C AGCAAC CTGGGACCGAAC CGGTGGCACAATGAACGG ATCACCTTGTCAACT 

GGCAAAAGTGTATGGCTTTTCAACATCACAGCCGACCCATATGAGAGGGTGGACCTATCTAA 

CAGGTATC CAGGAAT C GTS&AGAAGCTCCTACGGAGGCTCT CACAGTTCAACAAAACTGCAG 

TG CCGGTCAGGTAT C C C C C CAAAGACCCCAGAAGTAAC C CTAGGCTCAATGGAGGGGTCTGG 

GGACCATGGTATAAAGAGGAAACCAAGAAAAAGAAGCCAAGCAAAAATCAGGCTGAGAAAAA 

GGAAAAGAAAAGCAAAAAAAAGAAGAAGAAACAGCAGAAAGCAGTCTCAGGTAAACCAGCAA 

ATTTGGCTCGATAATATCGCTGGCCTAAGCGTCAGGCTTGTTTTCATGCTGTGCCACTCCAG 

AGACTT CTGC CAC CTGGCCGCCACACTGAAAACTGTCCTGCTCAGTGCCAAGGTGCTACTCT 

TGCAAGCCACACTTAGAGAGAGTGGAGATGTTTATTTCTCTCGCTCCTTTAGAAAACGTGGT 

GAGTCCTGAGTTCCACTGCTGTGCTTCAGTCAACTGACCAAACACTGCTTTGAATTATAGGA 

GGAGAACAATAACCTACCATCCGCAAGCATGCTAATTTGATGGA?VGTTACAGGGTAGCATGA 

TTAAAACTACCTTTGATAAATTACAGTCAAAGATTGTGTCACCTCAAAGGCCTTGAAGAATA 

TATTTTCTTGGTGAATTTTTGTATGTCTGTCATATGACACTTGGGTTTTTTAATTAATTCTA 

TTTTATATATATAAATATATGTTTCTTTTCCTGTGAAAAGCTGTTTTTCTCACATGTGAACA 

GCTTGCACCTCATTTTACCATGCGTGAGGGAATGGCAAATAAGAATGTTTGAGCACACTGCC 

CACAATGAATGTAACTATTTTCTAAACACTTTACTAGAAGAACATTTCAGTATAAAAAACCT 

AATTTATTTTTACAGAAAAATATTTTGTTGTTTTTATAAAAAGTTATGCAAATGACTTTTAT 

TTTTATTTC CTG CATAC CATTAGAAGAATTTTATTTCATTTCTTCAAATTATCAAGCACTGT 

AATACTATAAATTAATGTAATACTGTGTGAATTCAGACTATAAAAAACATCATTCAGAA^ 

TTTATAATCGTCATTGTTCAATCAAGATTTTGAATGTAATAAGATGAATATATTCCTTACAA 
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ATT ACTTGG AAATT CAATGTTTGTGCAGAGTTGAGACAACTTTATTGTTT CTAT CATAAACT 
ATTTATGTATCTTAATTATTAAAATGATTTACTTTATGGCACTAGAAAATTTACTGTGGCTT 
TTCTGATCTAACTTCTAGCTAAAATTGTATCATTGGTCCTAAAAAATAAAAATCTTTACTAA 
TAGGCAATTGAAGGAATGGTTTGCTAACAACCACAGTAATATAATATGATTTTACAGATAGA 
TGCTTCCCCTTGGCTATGACATGGAGAAAGATTTTCCCATAATAATAACTAATATTTATATT 
AGGTTGGTG C AAAACT AGTTGCGGTTTTTCC C ATTAAAAGTAATAAC CTTACTCTTATACAA 

AGTGGACACTGTGGGGAGAT ACAGAGAAATGGAAGATACGGATCCTGC CTGGAGTAGGTAAC 
CTTGCTT GGAAAC C C C ACATGCAAACGT CATGAGGAGAATTAAAGGAGTATT AT CAGTAATG 
AAGTTTAT CATGGGT CATCAATGAGCATAGATTGGTGTGGAT C CTGTAGAC CCTGGTGTTTT 
GTTTGAAGTG C C CT CTC CT AATGCAGAGG C CTTGAAGCTTACAGTATACACTTGAAAAGTCA 
CAGATAGCTAGAATTATGATCTTTGAAGTTATAACTGTGATCTGAAAATGTGTGTGGTGGTA 
TGACAGC ATA C CATTAAATACATTTACAT CACAGCTCAAAGGACTGTGATATAATC CATTTA 
TATC ACAA CT C AAAGGACTGTGATATAATCCATTTATATCACAGCT CACAGTTTCTGAAAAT 
GTATAAAAGAATCTATAATCTAGTACTGAAATTACTAAATTGGGTAAGATGATTTAAATGAT 
TTTAATTTTAACATTTTATTTCTAGAATATATGGCTCCATTTTATTTTATAGTGTAAAGTTG 
TATTTCCTAAAGTTTGTGTTTTGTCGACAGTATCTTTTAAATGAGTCTTAAAAATAAAGGCA 
TATTGTTCATGTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIG UR E 43 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA48296 
xsubunit 1 of 1, 515 aa, 1 stop 
XMW: 56885, pi: 6.49, NX(S/T): 5 
MAPRGCAGHPPPPSPQACVCPGKMLAMGAIAGFWILCLL^ 
GEKLEPSTTSTSQPHLIFILADDQGFRDVGYHGSEIOT 

5RSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKEG 
MPTRRGFDTFFGSLLGSGDYYTHYKCDSPGMGGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 
I LASHNPTKP I FL YTAYQAVHS PLQAPGRYFEHYRS I ININRRRYAAMLS CLDEAINNVTLA 
LKTYGFYNNS 1 1 1 YSSDNGGQPTAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCK 
ELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS PRVDILHNIDPYTPRQKMAPG 

QQAMGSGTLQSSQPSECSTGNCLQEIIATATGSPLSLSATWDRTGGTMNGSPCQLAKVYGFS 
TSQPTHMRGWTYLTGIQES 

Important Features : 
Signal Peptide: 

amino acids 1-37 

Sulf atases signature 1 . 

amino acids 120-132 

Sulf atases signature 2 . 

amino acids 168-177 

Tyrosine kinase phosphorylation site, 
amino acids 163-169 

N-glycosylation sites. 

amino acids 157-160, 306.-309 and 318-321 
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WO 99/46281 



PCT/US99/0502S 



FIGURE 44 

CGGACGCGTGGGTGCGAGTGGAGCGGAGGACCCGAGCGGCTGAGGAGAGAGGAGGCGGCGGC 
TTAGCTGCTACGGGGTCCGGCCGGCGCCCTCCCGAGGGGGGCTCAGGAGGAGGAAGGAGGAC 
CCGTGCGAGAA2£CCTCTGCCCTGGAGCCTTGCGCTCCCGCTGCTGCTCTCCTGGGTGGCAG 
GTGGTTTCGGGAACGCGGCCAGTGCAAGGCATCACGGGTTGTTAGCATCGGCACGTCAGCCT 
GGGGTCTGTCACTATGGAACTAAACTGGCCTGCTGCTACGGCTGGAGAAGAAACAGCAAGGG 
AGTCTGTGAAGCTACATGCGAACCTGGATGTAAGTTTGGTGAGTGCGTGGGACCAAACAAAT 
GCAGATGCTTTCCAGGATACACCGGGAAAACCTGCAGTCAAGATGTGAATGAGTGTGGAATG 
AAACCCCGGCCATGCCAACACAGATGTGTGAATACACACGGAAGCTACAAGTGCTTTTGCCT 
CAGTGGCCACATGCTCATGCCAGATGCTACGTGTGTGAACTCTAGGACATGTGCCATGATAA 
ACTGTCAGTACAGCTGTGAAGACACAGAAGAAGGGCCACAGTGCCTGTGTCCATCCTCAGGA 
CTCCGCCTGGCCCCAAATGGAAGAGACTGTCTAGATATTGATGAATGTGCCTCTGGTAAAGT 
CATCTGTCCCTACAATCGAAGATGTGTGAACACATTTGGAAGCTACTACTGCAAATGTCACA 
TTGGTTTCGAACTGCAATATATCAGTGGACGATATGACTGTATAGATATAAATGAATGTACT 
ATGGATAGCCATACGTGCAGCCACCATGCCAATTGCTTCAATACCCAAGGGTCCTTCAAGTG 
TAAATGCAAGCAGGGATATAAAGGCAATGGACTTCGGTGTTCTGCTATCCCTGAAAATTCTG 
TGAAGGAAGTCCTCAGAGCACCTGGTACCATCAAAGACAGAATCAAGAAGTTGCTTGCTCAC 
AAAAACAGC AT GAAAAAGAAGGCAAAAAT T AAAAAT GT T ACCCCAGAACCCACCAGGACTC C 
TACCCCTAAGGTGAACTTGCAGCCCTTCAACTATGAAGAGATAGTTTCCAGAGGCGGGAACT 
CTCATGGAGGTAAAAAAGGGAATGAAGAGAAA.TSAAAGAGGGGCTTGAGGATGAGAAAAGAG 
AAGAGAAAGCCCTGAAGAATGACATAGAGGAGCGAAGCCTGCGAGGAGATGTGTTTTTCCCT 
AAGGTGAATGAAGCAGGTGAATTCGGCCTGATTCTGGTCCAAAGGAAAGCGCTAACTTCCAA 
ACTGGAACATAAAGATTTAAATATCTCGGTTGACTGCAGCTTCAATCATGGGATCTGTGACT 
GGAAACAGGATAGAGAAGATGATTTTGACTGGAATCCTGCTGATCGAGATAATGCTATTGGC 
TTCTATATGGCAGTTCCGGCCTTGGCAGGTCACAAGAAAGACATTGGCCGATTGAAACTTCT 
CCTACCTGACCTGCAACCCCAAAGCAACTTCTGTTTGCTCTTTGATTACCGGCTGGCCGGAG 
ACAAAGTCGGGAAACTTCGAGTGTTTGTGAAAAACAGTAACAATGCCCTGGCATGGGAGAAG 
ACCACGAGTGAGGATGAAAAGTGGAAGACAGGGAAAATTCAGTTGTATCAAGGAACTGATGC 
TACCAAAAGCATCATTTTTGAAGCAGAACGTGGCAAGGGCAAAACCGGCGAAATCGCAGTGG 
ATGGCGTCTTGCTTGTTTCAGGCTTATGTCCAGATAGCCTTTTATCTGTGGATGACTGAATG 
TTACTATCTTTATATTTGACTTTGTATGTCAGTTCCCTGGTTTTTTTGATATTGCATCATAG 
GACCTCTGGCATTTTAGAATTACTAGCTGAAAAATTGTAATGTACCAACAGAAATATTATTG 
TAAGATGCCTTTCTTGTATAAGATATGCCAATATTTGCTTTAAATATCATATCACTGTATCT 
T CT C AG T CAT T T C T GAAT C T TT C CNCAT T ATAT T AT AAAATNT GGAAANGT CAGT T TAT CT C 
CCCTCCTCNGTATATCTGATTTGTATANGTANGTTGATGNGCTTCTCTCTACAACATTTCTA 
GAAAATAGAAAAAAAAGCACAGAGAAATGTTTAACTGTTTGACTCTTATGATACTTCTTGGA 
AACTATGACATCAAAGATAGACTTTTGCCTAAGTGGCTTAGCTGGGTCTTTCATAGCCAAAC 
TTGTATATTTAATTCTTTGTAATAATAA 
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FIGURE 45 

MPLPWSLALPLLLSWVAGGFGNAASARHHG 

ATCBPGCKFGECVGPWKCRCFFGYTGKTCSQDWECGMKPRPCQHRCVNTHGSYKCFCLSGH 

MLMPDATCVNSRTCAMINCQYSCEDTEEGPQCLCPSSGLRLAPNGRDCLDIDECASGICVI CP 

YNRRCWTFGSYYCKCHIGFELQYISGRYDCIDIIffiCTMDSHTCSHHANCFNTQGSFKCKCK 

QGYKGNGLRCSAIPENSVKEVLRAPGTIKDRIKKLLAHKNSMKKKAKIKNVTPEPTRTP 
VNLQPFNYEE IVSRGGNSHGGKKGNEEK 

Signal peptide: 

amino acids 1-21 

EGF-like domain cysteine pattern signature. 

amino acids 80-91 

Calcium-binding EGF-like domains 

amino acids 103-124, 230-251 and 185-206 



Ail*®* 
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FIGURE 46 

GGGAGCTGCTGCTGTGGCTGCTGGTGCTGTGCGCGCTGCTCCTGCTCTTGGTGCAGCTGCTG 
CGCTTCCTGAGGGCTGACGGCGACCTGACGCTACTATGGGCCGAGTGGCAGGGACGACGCCC 
AGAATGGGAGCTGACTGATaSSGTGGTGTGGGTGACTGGAGCCTCGAGTGGAATTGGTGAGG 
AGCTGGCTTAC C AG TTGT CTAAACTAGGAGTTTCT CTTGTG CTGTCAGCCAGAAGAGTGCAT 

GAGCTGGAAAGGGTGAAAAGAAGATGCCTAGAGAATGGCAATTTAAAAGAAAAAGATATACT 
TGTTTTGCCCCTTGACCTGACCGACACTGGTTCCCATGAAGCGGCTACCAAAGCTGTTCTGC 
AGGAGTTTGGTAGAATCGACATTCTGGTCAACAATGGTGGAATGTCCCAGCGTTCTCTGTGC 
ATGGATACCAGCTTGGATGTCTACAGAAAGCTAATAGAGCTTAACTACTTAGGGACGGTGTC 
CTTGACAAAATGTGTT CTG C CT CACATGATCGAGAGGAAGCAAGGAAAGATTGTTACTGTGA 
ATAGCAT C CTGGGTAT C AT ATCTGTAC CTCTTTC CATTGGATACTGTGCTAGCAAGCATGCT 
CTCCGGGGTTTTTTTAATGGC CTTCGAACAGAACTTG C CACATAC CCAGGTATAATAGTTT C 
TAACATTTGC C CAGGAC CTGTGCAATCAAATATTGTGGAGAATTCCCTAGCTGGAGAAGTCA 
CAAAGACTATAGG C AAT AATGGAGACCAGTCC CACAAGATGACAAC CAGTCGTTGTGTGCGG 
CTGATGTTAATGAGCATGGCCAATGATTTGAAAGAAGTTTGGATCTCAGAACAACCTTTCTT 
GTTAGTAACATATTTGTGGCAATACATGCCAACCTGGGCCTGGTGGATAACCAACAAGATGG 
GGAAGAAAAGGATTGAGAACTTTAAGAGTGGTGTGGATGCAGACTCTTCTTATTTTAAAATC 
TTTAAGACAAAACAlJ2aCTGAA?^AGAGCACCTGTACTTTTCAAGCCACTGGAGGGAGAAATG 
GAAAAC ATGAAAAC AG CAAT CTT CTTATGCTTCTGAATAAT CAAAGACTAATTTGTGATTTT 

ACTTTTTAATAGATATGACTTTGCTTCCAACATGGAATGAAATAAAAAATAAATAATAAAAG 
ATTGCCATGAAT CTTG CAAAA 
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F IG UR E 47 



></us-r/seqdb2/sst/DNA/Dnaseqs .min/ss •DNA3 6343 
xsubunit 1 of 1, 28 9 aa, 1 stop 
><MW: 32268, pi: 9.21, NX'S/?): 0 

MVVWVTGAS SG I GEELA YQLSKLGVSLVLS ARRVHELERVKRRCLENGNLKEKD ILVLPLDL 
TDTGSHEAATKAVLQEFGR I D ILVNNGGMSQRSLCMDTSLDVYRKL I ELNYLGTVSLTKCVL 
PHMIERKQGKI VTVNS ILGI ISVPLS IGYCASKHALRGFFNGLRTEIATYPGI IVSNICPGP 
VQSNI VENSLAGEVTXTI GNNGDQSHKOT^ 
Q YMPTWAWW I TNKMGKKR I ENFKS GVDADS S YFK I FKTKHD 

Important Features: 
Signal Peptide : 

amino acids 1-31 

Transmembrane domain: 

amino acids 136-157 

Tyrosine kinase phosphorylation site. 
106-113 and 107-114 

Homologous region to Short -chain alcohol dehydrogenase 

amino acids 80-90, 131-168, 1-13 and 176-185 
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FIGU R E 48 



GCGACGTGGGCACCGCCATCAGCTGTTCGCGCGTCTTCTCCTCCAGGTGGGGCAGGGGTTTC 
GGGCTGGTGGAGCATGTGCTGGGACAGGACAGCATCCTCAA.TCAATCCAACAGCATATTCGG 
TTGCATCTTCTACACACTACAGCTATTGTTAGGTTGCCTGCGGACACGCTGGGCCTCTGTCC 
TGMSCTGCTGAGCTCCCTGGTGTCTCTCGCTGGTTCTGTCTACCTGGCCTGGATCCTGTTC 
TTCGTGCTCTATGATTTGTGCATTGTTTGTATCACCACCTATGCTATCAACGTGAGCCTGAT 
GTGGCTCAGTTTCCGGAAGGTCCAAGAACCCCAGGGCAAGGCTAAGAGGCACTGAGCCCTCA 
ACCCAAGCCAGGCTGACCTCATCTGCTTTGCTTTGGTCTTCAAGCCGCTCAGCGTGCCTGTG 
GACAGCGTGGCCCCGGCCCCCCCAAGCCTCAGGAGGGCAACACAGTCCCTGGCGAGTGGCCC 
TGGCAGGCCAGTGTGAGGAGGCAAGGAGCCCACATCTGCAGCGGCTCCCTGGTGGCAGACAC 
CTGGGTCCTCACTGCTGCCCACTGCTTTGAAAAGGCAGCAGCAACAGAACTGAATTCCTGGT 
CAGTGGTCCTGGGTTCTCTGCAGCGTGAGGGACTCAGCCCTGGGGCCGAAGAGGTGGGGGTG 
GCTGCCCTGCAGTTGCCCAGGGCCTATAACCACTACAGCCAGGGCTCAGACCTGGCCCTGCT 
GCAGCTCGCCCACCCCACGACCCACACACCCCTCTGCCTGCCCCAGCCCGCCCATCGCTTCC 
CCTTTGGAGCCTCCTGCTGGGCCACTGGCTGGGATCAGGACACCAGTGATGCTCCTGGGACC 
CTACGCAATCTGCGCCTGCGTCTCATCAGTCGCCCCACATGTAACTGTATCTACAACCAGCT 
GCACCAGCGACACCTGTCCAACCCGGCCCGGCCTGGGATGCTATGTGGGGGCCCCCAGCCTG 
GGGTGCAGGGCCCCTGTCAGGGAGATTCCGGGGGCCCTGTGCTGTGCCTCGAGCCTGACGGA 
CACTGGGTTCAGGCTGGCATCATCAGCTTTGCATCAAGCTGTGCCCAGGAGGACGCTCCTGT 
GCTGCTGACCAACACAGCTGCTCACAGTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAGCTT 
TCCTGGCC C AGAG C C C AGAG AC CC C GGAGAT GAGT GAT GAGGACAGC T GT GT AGC CT GT GGA 
TCCTTGAGGACAGCAGG'TCCCCAGGCAGGAGCACCCTCCCCATGGCCCTGGGAGGCCAGGCT 
GATGCACCAGGGACAGCTGGCCTGTGGCGGAGCCCTGGTGTCAGAGGAGGCGGTGCTAACTG 
CTGCCCACTGCTTCATTGGGCGCCAGGCCCCAGAGGAATGGAGCGTAGGGCTGGGGACCAGA 
CCGGAGGAGTGGGGCCTGAAGCAGCTCATCCTGCATGGAGCCTACACCCACCCTGAGGGGGG 
CTACGACATGGCCCTCCTGCTGCTGGCCCAGCCTGTGACACTGGGAGCCAGCCTGCGGCCCC 
TCTGCCTGCCCTATCCTGACCACCACCTGCCTGATGGGGAGCGTGGCTGGGTTCTGGGACGG 
GCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCGTGACCCTCCTGGGGCCTAG 
GGCCTGCAGCCGGCTGCATGCAGCTCCTGGGGGTGATGGCAGCCCTATTCTGCCGGGGATGG 
TGTGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGGGCCTGTCTGGGGCACCACTGGTG 
CATGAGGTGAGGGGCACATGGTTCCTGGCCGGGCTGCACAGGTTCGGAGATGCTTGCCAAGG 
CCCCGCCAGGCCGGCGGTCTTCACCGCGCTCCCTGCCTATGAGGACTGGGTCAGCAGTTTGG 
ACTGGCAGGTCTACTTCGCCGAGGAACCAGAGCCCGAGGCTGAGCCTGGAAGCTGCCTGGCC 
AACAT AAG C C AAC C AAC C AG C T GC3E&CAGGGGAC C T GGC CAT T C T CAGGACAAGAGAAT G C 
AGGCAGGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGTGATTCCAGGCAC 
CAGGGCAGGCCCAGAAGCCCAGCAGCTGTGGGAAGGAACCTGCCTGGGGCCACAGGTGCCCA 
CTCCCCACCCTGCAGGACAGGGGTGTCTGTGGACACTCCCACACCCAACTCTGCTACCAAGC 
AGGCGTCTCAGCTTTCCTCCTCCTTTACTCTTTCAGATACAATCACGCCAGCCACGTTGTTT 
TGAAAATTTCTTTTTTTGGGGGGCAGCAGTTTTCCTTTTTTTAAACTTAAATAAATTGTTAC 
AAAATAAAA 
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FIGURE 49 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA4 0571 
MLLSSLVSLAGSVYLAWILfFVLYDPCIVCITTYAIlTVSLMWL 

PGEWPWQASVRRQGAH I CS GSLVADTWVLTAAHCFEKAAATELNSWSWLGSLQREGLS PGA 
EEVGVAALQLPRAYNHYSQGSDIJVI^QliAHPTTOT 

DAPGTLRNLRLRLISRPTCNCIYNQLHQRHLSNPARPGMLCGGPQPGVQGPCQGDSGGPVLC 
LE PDGHWVQAG IIS FAS S CAQEDAP VLLTNTAAHS S WLQARVQGAAFLAQS PETPEMSDEDS 
CVACG S LRT AGP Q AG AP S P WPWEARLMHQGQLACGGAL VS E EAVLTAAHCF I GRQAP EEWS V 

GLGTRPEEWGLKQLILHGAYTHPEGGYDMALLLLAQPVTLGASLRPLCLPYPDHHLPDGERG 

WVLGRARPGAG I S S LQTVP VTLLGPRACSRLEAAPGGDGS P I LPGMVCTS AVGELPS CEGLS 

GAPLVHEVRGTWFLAGLHSFGDACQGPARPAVFTALPAYEDWVSSLDWQVYFAEEPEPEAEP 
GSCLANISQPTSC 

Important features: 
Signal peptide: 

amino acids 1-15 

Homologous region to Serine proteases, trypsin family 
amino acids 79-95, 343-359 and 237-247 

N-glycosylation sites. 

amino acids 3 7-40 and 564-567 

Kringle domains 

amino acids 79-96, 343-360 and 235-247 
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FIG UR E 5Q 



CGGGCCGCCCCCGGCCCCCATTCGGGCCGGGCCTCGCTGCGGCGGCGACTGAGCCAGGCTGG 



GCCGCGTCCCTGAGTCCCAGAGTCGGCGCGGCGCGGCAGGGGCAGCCTTCCACCACGGGGAG 
CCCAGCTGTCAGCCGCCTCACAGGAAGATSCTGCGTCGGCGGGGGAGCCCTGGCATGGGTGT 
GCATGTGGGTGCAGCCCTGGGAGCACTGTGGTTCTGCCTCACAGGAGCCCTGGAGGTCCAGG 
TCCCTGAAGACCCAGTGGTGGCACTGGTGGGCACCGATGCCACCCTGTGCTGCTCCTTCTCC 
C CTGAG C CT GGCTT GAG C CTGGCACAGCTCAAC CTCAT CTGGCAGCTGACAGATACCAAACA 
GCTGGTGCACAGCTTTGCTGAGGGCCAGGACCAGGGCAGCGCCTATGCCAACCGCACGGCCC 
TCTTCCCGGACCTGCTGGCACAGGGCAACGCATCCCTGAGGCTGCAGCGCGTGCGTGTGGCG 
GACGAGGGCAGCTTCACCTGCTTCGTGAGCATCCGGGATTTCGGCAGCGCTGCCGTCAGCCT 
GCAGGTGGCCGCTCCCTACTCGAAGCCCAGCATGACCCTGGAGCCCAACAAGGACCTGCGGC 
CAGGGGACACGGTGACCATCACGTGCTCCAGCTACCAGGGCTACCCTGAGGCTGAGGTGTTC 
TGGCAGGATGGGCAGGGTGTGCCCCTGACTGGCAACGTGACCACGTCGCAGATGGCCAACGA 
GCAGGGCTTGTTTGATGTGCACAGCGTCCTGCGGGTGGTGCTGGGTGCGAATGGCACCTACA 
GCTGCCTGGTGCGCAACCCCGTGCTGCAGCAGGATGCGCACRGCTCTGTCACCATCACAGGG 
CAGCCTATGACATTCCCCCCAGAGGCCCTGTGGGTGACCGTGGGGCTGTCTGTCTGTCTCAT 
TGCACTGCTGGTGGCCCTGGCTTTCGTGTGCTGGAGAAAGATCAAACAGAGCTGTGAGGAGG 
AGAAT G CAGGAGCTGAGGAC CAGGATGGGGAGGGAGAAGGCTCCAAGACAGCCCTGCAGCCT 
CTGAAAGACTCTGACAGCAAAGAAGATGATGGACAAGAAATAGCCISaCCATGAGGACCAGG 
GAGCTGCTACCCCTCCCTACAGCTCCTACCCTCTGGCTGCAATGGGGCTGCACTGTGAGCCC 
TGC C C C CAAC AGATGCAT C CTG CT CTG ACAGGTGGG CTC CTTCTCCAAAGGATGCGATACAC 
AGACCACTGTGCAGCCTTATTTCTCCAATGGACATGATTCCCAAGTCATCCTGCTGCCTTTT 
TT CTTAT AGACACAATGAAC AGACCACC CACAAC CTTAGTTCTCTAAGT CAT CCTGCCTGCT 
GCCTTATTTCACAGTACATACATTTCTTAGGGACACAGTACACTGACCACATCACCACCCTC 
TTCTTCCAGTGCTGCGTGGACCATCTGGCTGCCTTTTTTCTCCAAAAGATGCAATATTCAGA 
CTG ACTGAC C CCCTGC CTTATTTCAC CAAAGACACGATGCATAGTCACC C CGGCCTTGTTTC 
TCCAATGGCCGTGATACACTAGTGATCATGTTCAGCCCTGCTTCCACCTGCATAGAATCTTT 
TCTTCTCAGACAGGGACAGTGCGGCCTCAACATCTCCTGGAGTCTAGAAGCTGTTTCCTTTC 
CCCTCCTTCCTCCCTGCCCCAAGTGAAGACAGGGCAGGGCCAGGAATGCTTTGGGGACACCG 
AGGGGACTGCCCCCCACCCCCACCATGGTGCTATTCTGGGGCTGGGGCAGTCTTTTCCTGGC 
TTGC CT CTGGCCAGCTC CTGGCCT CTGGTAGAGTGAGACTT CAGACGTTCTGATGCCTT CCG 
GATGT CAT CT CT C CCTG C C C CAGGAATGGAAGATGTGAGGACTTCTAATTTAAATGTGGGAC 
TCGGAGGGATTTTGTAAACTGGGGGTATATTTTGGGGAAAATAAATGTCTTTGTAAAAAAAA 
AAAAAAAAAAAAAA 
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>< /usr /seqdb2 / sst /DNA/Dnaseqs . oiin/ ss . DNA4 1386 
xsubunit 1 of 1, 316 aa, 1 step, 1 unknown 
><MW: -1, pi: 4.62, NX(S/T): 4 

MLRRRGS PGMGVHVGAALGALWFCLTGALEVQVPED P WALVGTDATLCCSFS PEPGFSLAQ 
LNL I WQLTDTXQL VHS F AE GQDQGSAYAMITALFPDLLAQGNASLRLQRVRVADEGSFTCFV 

SIRDFGSAAVSLQVAAPYSKPSMTLEPNKDLRPGDTVTITCSSYQGYPEAEYFWQDGQGVPL 
TGNVTTSQMANEQGLFDVHSVLRVVLG^ 

LWVTVGLSVCLIALLVALAFVCWKIKQSCEEENAGAEDQDGEGEGSKTALQPLKHSDSKED 
DGQEIA 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 251-270 

N-glycosylation site* 

amino acids 91-94, 104-107, 189-192 and 215-218 

Homologous region to Immunoglobulins and MHC 

amino acids 217-234 



52/ Z&r 
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FIGURE 52 



TTCGTGACCCTTGAGAAAAGAGTTGGTGGTAAATGTGCCACGTCTTCTAAGAAGGGGGAGTC 
CTGAACTTGTCTGAAGCCCTTGTCCGTAAGCCTTGAACTACGTTCTTAAATCTATGAAGTCG 
AGGGAC CTTT CGCTG CTTTTGTAGGGACTTCTTTCCTTGCTTCAGCAACaiSAGGCTTTTCT 

TGTGGAACGCGGTCTTGACTCTGTTCGTCACTTCTTTGATTGGGGCTTTGATCCCTGAACCA 
GAAGTGAAAATTGAAGTTCTCCAGAAGCCATTCATCTGCCATCGCZAA 

TTTGATGTTGGTCCACTATGAAGGCTACTTAGAAAAGGACGGCTCCTTATTTCACTCCACTC 
ACAAACATAACAATGGTCAGCCCATTTGGTTTACCCTGGGCATCCTGGAGGCTCTCAAAGGT 
TGGGAC CAGGG CTTG AAAGG AATGTGTGTAGGAGAGAAGAGAAAGCTCATCATT CCT CCTGC 
TCTGGGCTATGGAAAAGAAGGAAAAGGTAAAATTCCCCCAGAAAGTACACTGATATTTAATA 
TTGAT CT C CTGGAG ATT CGAAATGGAC CAAGATCCGATGAATCATTCCAAGAAATGGATCTT 

AATGATGACTGGAAACTCTCTAAAGATGAGGTTAAAGCATATTTAAAGAAGGAGTTTGAAAA 

ACATGGTGCGGTGGTGAATGAAAGTCATCATGATGCTTTGGTGGAGGATATTTTTGATAAAG 

AAGATGAAGACAAAGATGGGTTTATATCTGCCAGAGAATTTACATATAAACACGATGAGTTA 

IASAGATACATCTACCCTTTTAATATAGCACTCATCTTTCAAGAGAGGGCAGTCATCTTTAA 

AGAACATTTTATTTTTATACAATGTTGTTTCTTGCTTTGTTTTTTATTTTTATATATTTTTT 

CTGACTCCTATTTAAAGAACCCCTTAGGTTTCTAAGTACCCATTTCTTTCTGATAAGTTATT 

GGG AAGAAAAAGCTAATTGGTCTTTGAATAGAAGACTTCTGGACAATTTTTCACTTTCACAG 

ATATGAAGCTTTGTTTTACTTTCTCACTTATAAATTTAAAATGTTGCAACTGGGAATATACC 

ACGACATGAGACCAGGTTATAGCACAAATTAGCACCCTATATTTCTGCTTCCCTCTATTTTC 

TCCAAGTTAGAGGTCAACATTTGAAAAGCCTTTTGCAATAGCCCAAGGCTTGCTATTTTCAT 

GTTATAATGAAATAGTTTATGTGTAACTGGCTCTGAGTCTCTGCTTGAGGACCAGAGGAAAA 

TGGTTGTTGGACCTGACTTGTTAATGGCTACTGCTTTACTAAGGAGATGTGCAATGCTGAAG 

TTAGAAACAAGGTTAATAGCCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAG 

GCTGAGGCGGGCGGATCACCTGAGGTTGGGAGTTCGAGACCAGCCTGACCAACACGGAGAAA 

CCCTATCTCTACTAAAAATACAAAGTAGCCCGGCGTGGTGATGCGTGCCTGTAATCCCAGCT 

ACCCAGGAAGGCTGAGGCGGCAGAATCACTTGAACCCGAGGCCGAGGTTGCGGTAAGCCGAG 

AT CAC CTNC AG C CTGGAC ACTCTGTCT CGAAAAAAGAAAAGAACACGGTT AA.TAC CATATNA 

ATATGTATGC ATTGAGAC ATG CTAC CT AGGACTTAAGCTGATGAAGCTTGGCT C CTAGTGAT 

TGGTGG C CT ATTATG ATAAATAGGACAAAT CATTTATGTGTGAGTTTCTTTGTAATAAAATG 

TATCAATATGTTATAGATGAGGTAGAAAGTTATATTTATATTCAATATTTACTTCTTAAGGC 

TAGCGG AAT AT C CTT C CTGGTTCTTTAATGGGTAGTCTATAGTATATTATACTAC AATAACA 

TTGTAT CATAAGATAAAGTAGTAAAC CAGT CTACATTTTC CCATTT CTGTCT CATCAAAAAC 

TGAAGTTAGCTGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGGGCCAAGGAGGG 

TGGATCACTTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTCTA 

CTAAAAATACAAAAATTAGCCAGGCGTGGTGGTGCACACCTGTAGTCCCAGCTACTCGGGAG 
G CTGAG ACAGGAGATTT G CTTGAAC CCGGGAGGCGGAGGTTGCAGTGAGC CAAGATTGTGCC 

ACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAGAAGCAGA 
CCTACAGCAGCTACTATTGAATAAATACCTATCCTGGATTTT 




WO 99/46281 



PCTYUS99/G5028 




></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA44194 
xsubunit 1 of 1, 211 aa, 1 stop 
><MW: 24172, pi: 5.99, NX(S/T): 1 
MRLFLWNAVLTLFVTSLIGALIP 

FHS THKHNNGQ PIWFTLGI LE ALKGWDQGLKGMCVGEKRKL IIP PALGYGKEGKGKI PPEST 
L I FNIDLLE I RNGPRSHES FQEMDLNDDWKLSKDE VKAYLKKEFEKHGAWNESHHDALVED 
I FDKEDEDKDGFI SAREFT YKHDEL 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 176-179 

Casein kinase II phosphorylation site. 

amino acids 143-146, 156-159, 178-181 and 200-203 

Endoplasmic reticulum targeting sequence, 

amino acids 208-211 

FKBP-type pep tidyl -prolyl cis- trans isomerase 

amino acids 78-114 and 118-131 

EF-hand calcium -binding domain. 

amino acids 191-203, 184-203 and 140-159 

S-100/ICaBP type calcium binding domain 

amino acids 183-203 



WO 99/46281 



PCT/US99/05028 



FIGURE 54 



AATAAAG CTTC CTT AATGTTGTATATGTCTTTGAAGTACAT CCGTGCATTTTTTTTTAGCAT 
CCAACCATTCCTCCCTTGTAGTTCTCGCCCCCTCAAATCACCCTCTCCCGTAGCCCACCCGA 
CTAACATCTCAGTCTCTGAAAATSCACAGAGATGCCTGGCTACCTCGCCCTGCCTTCAGCCT 
CACGGGGCTCAGTCTCTTTTTCTCTTTGGTGCCAGCAGGACGGAGCATGGAGGTCACAGTAC 
CTGCCACCCTCAACGTCCTCAATGGCTCTGACGCCCGCCTGCCCTGCACCTTCAACTCCTGC 
TACACAGTGAACCACAAACAGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTC 
TGAGGAGATG TT CCT C C AG TTCCGCATGAAGATCATTAACCTGAAG CTGGAGCGGTTTCAAG 
ACCGCGTGGAGTTCTCAGGGAACCCCAGCAAGTACGATGTGTCGGTGATGCTGAGAAACGTG 
CAGCCGGAGGATGAGGGGATTTACAACTGCTACATCATGAACCCCCCTGACCGCCACCGTGG 
CCATGGCAAGATCCATCTGCAGGTCCTCATGGAAGAGCCCCCTGAGCGGGACTCCACGGTGG 
CCGTGATTGTGGGTGCCTCCGTCGGGGGCTTCCTGGCTGTGGTCATCTTGGTGCTGATGGTG 
GTCAAGTGTGTGAGGAGAAAAAAAGAGCAGAAGCTGAGCACAGATGACCTGAAGACCGAGGA 
GGAGGGCAAGACGGACGGTGAAGGCAACCCGGATGATGGCGCCAAGiaSTGGGTGGCCGGCC 
CTGCAGCCTCCCGTGTCCCGTCTCCTCCCCTCTCCGCCCTGTACAGTGACCCTGCCTGCTCG 
CTCTTGGTGTGCTTCCCGTGACCTAGGACCCCAGGGCCCACCTGGGGCCTCCTGAACCCCCG 
ACTTCGTATCTCCCACCCTGCACCAAGAGTGACCCACTCTCTTCCATCCGAGAAACCTGCCA 
TGCTCTGGGACGTGTGGGCCCTGGGGAGAGGAGAGAAAGGGCTCCCACCTGCCAGTCCCTGG 
GGGGAGG CAGGAGG CACATGTGAGGGTCCCCAGAGAGAAGGGAGTGGGTGGGCAGGGGTAGA 
GGAGGGGCCGCTGTCACCTGCCCAGTGCTTGCCTGGCAGTGGCTTCAGAGAGGACCTGGTGG 
GGAGGGAGGGCTTTCCTGTGCTGACAGCGCTCCCTCAGGAGGGCCTTGGCCTGGCACGGCTG 
TGCTCCTCCCCTGCTCCCAGCCCAGAGCAGCCATCAGGCTGGAGGTGACGATGAGTTCCTGA 
AACTTGGAGGGGCATGTTAAAGGGATGACTGTGCATTCCAGGGCACTGACGGAAAGCCAGGG 
CTGCAGGCAAAGCTGGACATGTGCCCTGGCCCAGGAGGCCATGTTGGGCCCTCGTTTCCATT 
GCTAGTGGCCTCCTTGGGGCTCCTGTTGGCTCCTAATCCCTTAGGACTGTGGATGAGGCCAG 
ACTGGAAGAGCAGCTCCAGGTAGGGGGCCATGTTTCCCAGCGGGGACCCACCAACAGAGGCC 
AGTTTCAAAGTCAGCTGAGGGGCTGAGGGGTGGGGCTCCATGGTGAATGCAGGTTGCTGCAG 
GCTCTGCCTTCTCCATGGGGTAACCACCCTCGCCTGGGCAGGGGCAGCCAAGGCTGGGAAAT 
GAGGAGGCCATGCACAGGGTGGGGCAGCTTTCTTTGGGGCTTCAGTGAGAACTCTCCCAGTT 
GCCCTTGGTGGGGTTTCCACCTGGCTTTTGGCTACAGAGAGGGAAGGGAAAGCCTGAGGCCG 
GCATAAGGGGAGGCCTTGGAACCTGAGCTGCCAATGCCAGCCCTGTCCCATCTGCGGCCACG 
CTACTCGCTCCTCTCCCAACAACTCCCTTCGTGGGGACAAAAGTGACAATTGTAGGCCAGGC 
ACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATTACCTCCAT 
CTGTTTAGTAGAAATGGGCAAAACCCCATCTCTACTAAAAATACAAGAATTAGCTGGGCGTG 
GTGGCGTGTGCCTGTAATCCCAGCTATTTGGGAGGCTGAGGCAGGAGAATCGCTTGAGCCCG 
GGAAGCAGAGGTTGCAGTGAACTGAGATAGTGATAGTGCCACTGCAATTCAGCCTGGGTGAC 
ATAGAGAGACTCCATCTCAAAAAAAA 




WO 99/46281 



PCT/US99/05028 



F IG URE 55 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss ,DNA45415 
<subunit l of 1, 215 aa, 1 stop 
<:MW: 24326, pi: 6-32, NX{S/T): 4 
MHRDAWLPRPAFSLTGLSLFFSLVPPGRSMEVWPAT^ 

FSLNWTYQECNNCSEEMFLQFRMKIINLKLERFQDRVEFSGNPSKYDVSVMLRNVQPEDEGI 

YNCYIMNPPDRHRGHGKIHLQVLMEEPPERDSWAVIV^^ 

KEQKLSTDDLKTEEEGKTDGEGNPDDGAK 

Important features: 
Signal peptide: 

amino acids 1-2 0 

Transmembrane domain: 

amino acids 161-179 

Immunoglobulin- like fold : 

amino acids 83-127 

N-glycosylation sites. 

amino acids 42-45, 66-69 and 74-77 




WO 99/46281 



PCT/US99/QS028 




GTTGTATATGTCCTGAAGTACATCCGTGCATTTTTTTTAGCATCCAACCATCCTCCCTTGTA 
GTTCTCGCCCCCTCAAATCACCTTCTCCCTTAGCCCACCCNACTAACATCTCAGTCTCTGAA 
AATGCACAGAGATGCCTGGCTACCTCGCCCTGCCTTCAGCCTCACGGGGCTCAGTCTCTTTT 
TCTCTTTGGTGCCACCAGGACGGAGCATGGAGGTCCACAGTACCTGNCCACCCTCAACGTCC 
TCAATGGCTCTGACGCCCGCCTGCCCTGCCCTTC^CTCCTGCTACACAGTGAACCAC^lAC 
AGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTCTGAGGAGATGTTCCTCCAG 
TTCCGCATGAAGATCATTAACCTGAAGCTGGAGCGGTTTCAAGACCGCGTGGAGTTCTCAGG 
GAACCCCAGCAAGTACGATGTGTCGGTGATGCTGAGAAACGTGCAGCCGGAGGATGAGGGGA 
TTTACAACTGCTACATCATGAACCCCCC 



WO 99/46281 



PCT/US99/05028 



FIGURE 57 



TCACGGGGCTCATCTCTTTTTCTCTTTGGTGCCCACCAGGACGGAGCATGGAGGTNCACATA 



CCTGCCACCCTCAACGTCCTCAATGGCTTTGACGCCCGCCTGCCCTGCACCTTCAACTCCNG 
CTACACAGTGAACCACAAACAGTTCTCCCTGAACTGGATTTACCAGGAGTGCAACAACTGGC 
TCTGAGGAGATGTTCCTCCAGTTCCCGCATGGAAGATCATTTAACCTGAAAGCTGGAAGCGG 
TTTT CAAGAAC CGCGTGGAAGTTTCT CAGGGAACCCCAG CAAGTACGATGTGTCGGTGATGC 
TGAGAAACGTGCAGCCGGAGGATGAGGGGATTTACAACTGCTACATCATGAACCCCCC 




WO 99/46281 



PCT/US99/05028 



FIGURE 58 



TGCGGCGACCGTCGTACACCMSGGCCTCCACCTCCGCCCCTACCGTGTGGGGCTGCTCCCG 
GATGGCCTCCTGTTCCTCTTGCTGCTGCTAATGCTGCTCGCGGACCCAGCGCTCCCGGCCGG 
ACGTCACCCCCCAGTGGTGCTGGTCCCTGGTGATTTGGGTAACCAACTGGAAGCCAAGCTGG 
ACAAGC C G AC AG T G G T G C AC T AC C T C T GC T CCAAGAAGAC C GAAAGCTACT T CACAATCT GG 

CTGAACCTGGAACTGCTGCTGCCTGTCATCATTGACTGCTGGATTGACAATATCAGGCTGGT 
TTACAACAAAACATCCAGGGCCACCCAGTTTCCTGATGGTGTGGATGTACGTGTCCCTGGCT 
TTGGGAAGACCTTCTCACTGGAGTTCCTGGACCCCAGCAAAAGCAGCGTGGGTTCCTATTTC 
CACACCATGGTGGAGAGCCTTGTGGGCTGGGGCTACACACGGGGTGAGGATGTCCGAGGGGC 
TCCCTATGACTGGCGCCGAGCCCCAAATGAAAACGGGCCCTACTTCCTGGCCCTCCGCGAGA 
TGATCGAGGAGATGTACCAGCTGTATGGGGGCCCCGTGGTGCTGGTTGCCCACAGTATGGGC 
.AACATGTACACGCTCTACTTTCTGCAGCGGCAGCCGCAGGCCTGGAAGGACAAGTATATCCG 
GGCCTTCGTGTCACTGGGTGCGCCCTGGGGGGGCGTGGCCAAGACCCTGCGCGTCCTGGCTT 
CAGGAGACAACAACCGGATCCCAGTCATCGGGCCCCTGAAGATCCGGGAGCAGCAGCGGTCA 
GCTGTCTC CAC C AG CTGGCTGCTGCCC TACAACT AC ACAT GG TCAC C TGAGAAGG T G T T CG T 
GC AGACAC C C AC AAT C AAC TAC ACAC T GC GGGAC T ACCGCAAGT TCTT CCAGGACAT CGGCT 
T T GAAGAT GG C T G GC T C ATGCGGC AGGAC ACAGAAGGGC T GG T GGAAGCCACGAT GCCACC T 
GGCGTGCAGCTGCACTGCCTCTATGGTACTGGCGTCCCCACACCAGACTCCTTCTACTATGA 
GAGCTTCCCTGACCGTGACCCTAAAATCTGCTTTGGTGACGGCGATGGTACTGTGAACTTGA 
AGAGTGCCCTGCAGTGCCAGGCCTGGCAGAGCCGCCAGGAGCACCAAGTGTTGCTGCAGGAG 
C TGC CAGGC AG C GAG CAC AT C GAGAT GC T GGC CAAC G C CAC CAC C C TGGC CTAT C T GAAAC G 

TGTGCTCCTTGGGCCC1SECTCCTGTGCCACAGGACTCCTGTGGCTCGGCCGTGGACCTGCT 
GTTGGCCTCTGGGGCTGTCATGGCCCACGCGTTTTGCAAAGTTTGTGACTCACCATTCAAGG 
CCCCGAGTCTTGGACTGTGAAGCATCTGCCATGGGGAAGTGCTGTTTGTTATCCTTTCTCTG 
TGGCAGTGAAGAAGGAAGAAATGAGAGTCTAGACTCAAGGGACACTGGATGGCAAGAATGCT 
GCTGATGGTGGAACTGCTGTGACCTTAGGACTGGCTCCACAGGGTGGACTGGCTGGGCCCTG 
GTCCCAGTCCCTGCCTGGGGCCATGTGTCCCCCTATTCCTGTGGGCTTTTCATACTTGCCTA 
CTGGGCCCTGGCCCCGCAGCCTTCCTATGAGGGATGTTACTGGGCTGTGGTCCTGTACCCAG 
AGG T C C C AGGG AT CGGCTCCTGGCCCCTCG GGT GAC CCT T C C C ACACACCAGCCACAGATAG 
GCCTGCCACTGGTCATGGGTAGCTAGAGCTGCTGGCTTCCCTGTGGCTTAGCTGGTGGCCAG 
CCTGACTGGCTTCCTGGGCGAGCCTAGTAGCTCCTGCAGGCAGGGGCAGTTTGTTGCGTTCT 
TCGTGGTTCCCAGGCCCTGGGACATCTCACTCCACTCCTACCTCCCTTACCACCAGGAGCAT 
TCAAGCTCTGGATTGGGCAGCAGATGTGCCCCCAGTCCCGCAGGCTGTGTTCCAGGGGCCCT 
GATTTCCTCGGATGTGCTATTGGCCCCAGGACTGAAGCTGCCTCCCTTCACCCTGGGACTGT 
GGTTCCAAGGATGAGAGCAGGGGTTGGAGCCATGGCCTTCTGGGAACCTATGGAGAAAGGGA 
ATCCAAGGAAGCAGCCAAGGCTGCTCGCAGCTTCCCTGAGCTGCACCTCTTGCTAACCCCAC 
CATCACACTGCCACCCTGCCCTAGGGTCTCACTAGTACCAAGTGGGTCAGCACAGGGCTGAG 
GATGGGGCTCCTATCCACCCTGGCCAGCACCCAGCTTAGTGCTGGGACTAGCCCAGAAACTT 
GAATGGGACCCTGAGAGAGCCAGGGGTCCCCTGAGGCCCCCCTAGGGGCTTTCTGTCTGCCC 
CAGGGTGCTCCATGGATCTCCCTGTGGCAGCAGGCATGGAGAGTCAGGGCTGCCTTCATGGC 
AGTAGGCTCTAAGTGGGTGACTGGCCACAGGCCGAGAAAAGGGTACAGCCTCTAGGTGGGGT 
TCCCAAAGACGCCTTCAGGCTGGACTGAGCTGCTCTCCCACAGGGTTTCTGTGCAGCTGGAT 
TTTCTCTGTTGCATACATGCCTGGCATCTGTCTCCCCTTGTTCCTGAGTGGCCCCACATGGG 
GCTCTGAGCAGGCTGTATCTGGATTCTGGCAATAAAAGTACTCTGGATGCTGTAAAAAAAAA 
AAAAAAAAAAAAAA 




WO 99/46281 



PCT/US99/05G28 




></usr/seqdb2/sst/DNA/Dnaseqs .min/ss -DNA44189 
xsubunit 1 of 1, 412 aa, 1 stop 
><MW: 46658, pi: 6.65, NX(S/T): 4 

MGLHLRPYRVGLLPDGLLFLLLLmLLADPALPAGRHPPWLVPGDLGNQLE^LDKPTVTO 
YLCSKKTESYFTIWLNLELLLPVIIDCWIDNIRLVYNKTSRATQFPDGVDVRVPGFGKTFSL 
EFLDPSKSSVGSYFHT^ESLVGWGYTRGEDVRGAPYDWRRAPNENGPYFIALREMIEEmQ 
LYGGPVVLVAHSMGNMYTLYFLQRQPQAWKDKYIRA^ 

PVIGPLKIREQQRSAVSTSWLLPYNYTWSPEKVFVQTPTINYTLRDYRKFFQDIGFEDGWLM 
RQDTEGLVEATMPPGVQLHCLYGTGVPTPDSFYYESFPDRDPKICFGDGDGTVWLKSALQCQ 
AWQSRQEHQVLLQELPGSEHIEMLANATTLAYLKRVLLGP 

Important features: 
Signal peptide: 

amino acids 1-28 

Potential lipid stabs trate binding site: 
amino acids 147-164 

N-glycosylation sites. 

amino acids 99-102, 273-276, 289-292 and 398-401 

Lipases, serine proteins 

amino acids 189-201 

Beta-transducin family Trp-Asp repeat 

amino acids 353-365 



WO 99/46281 



PCT/US99/05028 



FIGURE 60 

CGGACGCGTGGGCGGACGCGTGGGGCGGCGGCAGCGGCGGCGACGGCGAC^ISGAGAGCGGG 
GCCTACGGCGCGGCCAAGGCGGGCGGCTCCTTCGACCTGCGGCGCTTCCTGACGCAGCCGCA 
GGTGGTGGCGCGCGCCGTGTGCTTGGTCTTCGCCTTGATCGTGTTCTCCTGCATCTATGGTG 
AGGGCTACAG C AATGC C CACGAGTCTAAGGAGATGTACTGCGTGTTCAACCGCAACGAGGAT 
GCCTGCCGCTATGGCAGTGCCATCGGGGTGCTGGCCTTCCTGGCCTCGGCCTTCTTCTTGGT 
GGTCGACGCGTATTTCCCCCAGATCAGCAACGCCACTGACCGCAAGTACCTGGTCATTGGTG 
ACCTGCTCTTCTCAGCTCTCTGGACCTTCCTGTGGTTTGTTGGTTTCTGCTTCCTCACCAAC 
CAGTGGGCAGTCACCAACCCGAAGGACGTGCTGGTGGGGGCCGACTCTGTGAGGGCAGCCAT 
CACCTTCAGCTTCTTTTCCATCTTCTCCTGGGGTGTGCTGGCCTCCCTGGCCTACCAGCGCT 
ACAAGGCTGGCGTGGACGACTTCATCCAGAATTACGTTGACCCCACTCCGGACCCCAACACT 
GCCTACGCCTCCTACCCAGGTGCATCTGTGGACAACTACCAACAGCCACCCTTCACCCAGAA 
CGCGGAGACCACCGAGGGCTACCAGCCGCCCCCTGTGTACJS^GTGGCGGTTAGCGTGGGAA 
GGGGGACAGAGAGGGCCCTCCCCTCTGCCCTGGACTTTCCCATCAGCCTCCTGGAACTGCCA 
GCCCCTCTCTTTCACCTGTTCCATCCTGTGCAGCTGACACACAGCTAAGGAGCCTCATAGCC 
TGGCGGGGGCTGGCAGAGCCACACCCCAAGTGCCTGTGCCCAGAGGGCTTCAGTCAGCCGCT 
CACTCCTCCAGGGCACTTTTAGGAAAGGGTTTTTAGCTAGTGTTTTTCCTCGCTTTTAATGA 
CCTCAGCCCCGCCTGCAGTGGCTAGAAGCCAGCAGGTGCCCATGTGCTACTGACAAGTGCCT 
CAGCTTCCCCCCGGCCCGGGTCAGGCCGTGGGAGCCGCTATTATCTGCGTTCTCTGCCAAAG 
ACTCGTGGGGGCCATCACACCTGCCCTGTGCAGCGGAGCCGGACCAGGCTCTTGTGTCCTCA 
CTCAGGTTTGCTTCCCCTGTGCCCACTGCTGTATGATCTGGGGGCCACCACCCTGTGCCGGT 
GGCCTCTGGGCTGCCTCCCGTGGTGTGAGGGCGGGGCTGGTGCTCATGGCACTTCCTCCTTG 
CTCCCACCCCTGGCAGCAGGGAAGGGCTTTGCCTGACAACACCCAGCTTTATGTAAATATTC 
TGCAGTTGTTACTTAGGAAGCCTGGGGAGGGCAGGGGTGCCCCATGGCTCCCAGACTCTGTC 

TGTGCCGAGTGTATTATAAAATCGTGGGGGAGATGCCCGGCCTGGGATGCTGTTTGGAGACG 
GAATAAATGTTTTCTCATTCAAAG 



WO 99/46281 



PCT/US99/0502S 



FIGURE 61 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA48304 
<subunit 1 of l, 224 aa, 1 stop 
<MW: 24810, pi:- 4.75, NX(S/T): 1 

MESGAYGAAKAGGS FDLRRFLTQPQ WARAVCLVFAL I VFS C I YGEGYSNAHESKQMYCVFN 

RNEDACRYGSAI G VL AFLAS AFFLWDAYFPQ I SNATDRKYL VI GDLLFS ALWTFLWFVGFC 

FLTNQWAVTNPKDVLVGADSVRAAIT^ 

DPNTAYAS YPGAS VDNYQQP PFTQNAETTEG YQP P PVY 

Important features r 

Type II Transmembrane domain: 

amino acids 24-43 

Other transmembrane domains: 

amino acids 74-90 , 108-126 and 145-161 

N-glycosylation site. 

amino acids 97-100 



WO 99/46281 



PCT/US99/05028 



FIGURE 62 

GAGCCACCTACCCTGCTCCGAGGCCAGGCCTGCAGGGCCTCATCGGCCAGAGGGTGATCAGT 
GAGCAGAAGG^ISCCCGTGGCCGAGGCCCCCCAGGTGGCTGGCGGGCAGGGGGACGGAGGTG 
ATGGCGAGGAAGCGGAGCCAGAGGGGATGTTCAAGGCCTGTGAGGACTCCAAGAGAAAAGCC 
CGGGGCTACCTCCGCCTGGTGCCCCTGTTTGTGCTGCTGGCCCTGCTCGTGCTGGCTTCGGC 
GGGGGTGCTACTCTGGTATTTCCTAGGGTACAAGGCGGAGGTGATGGTCAGCCAGGTGTACT 
CAGGCAGTCTGCGTGTACTCAATCGCCACTTCTCCCAGGATCTTACCCGCCGGGAATCTAGT 
GCCTT C CGCAGTGAAAC CGC CAAAG CCCAGAAGATGCTCAAGGAG CTCATCACCAGCACCCG 

CCTGGGAACTTACTACAACTCCAGCTCCGTCTATTCCTTTGGGGAGGGACCCCTCACCTGCT 
TCTTCTGGTTCATTCTCCAAATCCCCGAGCACCGCCGGCTGATGCTGAGCCCCGAGGTGGTG 
CAGGCACTGCTGGTGGAGGAGCTGCTGTCCACAGTCAACAGCTCGGCTGCCGTCCCCTACAG 
GGCCGAGTACGAAGTGGACCCCGAGGGCCTAGTGATCCTGGAAGCCAGTGTGAAAGACATAG 
CTGCATTGAATTCCACGCTGGGTTGTTACCGCTACAGCTACGTGGGCCAGGGCCAGGTCCTC 
CGGCTGAAGGGGCCTGACCACCTGGCCTCCAGCTGCCTGTGGCACCTGCAGGGCCCCAAGGA 
CCTCATGCTCAAACTCCGGCTGGAGTGGACGCTGGCAGAGTGCCGGGACCGACTGGCCATGT 
ATGACGTGGCCGGGCCCCTGGAGAAGAGGCTCATCACCTCGGTGTACGGCTGCAGCCGCCAG 
GAGCCCGTGGTGGAGGTTCTGGCGTCGGGGGCCATCATGGCGGTCGTCTGGAAGAAGGGCCT 
GCACAGCTACTACGACCCCTTCGTGCTCTCCGTGCAGCCGGTGGTCTTCCAGGCCTGTGAAG 
TGAACCTGACGCTGGACAACAGGCTCGACTCCCAGGGCGTCCTCAGCACCCCGTACTTCCCC 
AGCTACTACTCGCCCCAAACCCACTGCTCCTGGCACCTCACGGTGCCCTCTCTGGACTACGG 
CTTGGCCCTCTGGTTTGATGCCTATGCACTGAGGAGGCAGAAGTATGATTTGCCGTGCACCC 
AGGGC CAGTGGACGATC CAGAACAGGAGGCTGTGTGGCTTGCGCATCCTGCAGCCCTACGCC 
GAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACCTCCCAGATCTCCCT 
CACCGGGCCCGGTGTGCGGGTGCACTATGGCTTGTACAACCAGTCGGACCCCTGCCCTGGAG 
AGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGGACTGCCCC 
AACGGCCTGGATGAGAGAAACTGCGTTTGCAGAGCCACATTCCAGTGCAAAGAGGACAGCAC 
ATGCATCTCACTGCCCAAGGTCTGTGATGGGCAGCCTGATTGTCTCAACGGCAGCGATGAAG 
AGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACCTTCCAGTGTGAGGACCGGAGCTGC 
GTGAAGAAGCCCAACCCGCAGTGTGATGGGCGGCCCGACTGCAGGGACGGCTCGGATGAGGA 
GCACTGTGACTGTGGCCTCCAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGTCCTCCG 
AGGGTGAGTGGCCATGGCAGGCCAGCCTCCAGGTTCGGGGTCGACACATCTGTGGGGGGGCC 
CTCATCGCTGACCGCTGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCATGGCCTC 
CACGGTGCTGTGGACCGTGTTCCTGGGCAAGGTGTGGCAGAACTCGCGCTGGCCTGGAGAGG 
TGTC CTT CAAGGTGAGCCG C CTGCTCCTGCACC CGTAC CACGAAGAGGACAGCCATGACTAC 

GACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGCCCCGTCTG 
CCTGCCCGCGCGCTCCCACTTCTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCTGGGGCG 
CCTTGCGCGAGGGCGGCCCCATCAGCAACGCTCTGCAGAAAGTGGATGTGCAGTTGATCCCA 
CAGGACCTGTGCAGCGAGGCCTATCGCTACCAGGTGACGCCACGCATGCTGTGTGCCGGCTA 
CCGCAAGGGCAAGAAGGATGCCTGTCAGGGTGACTCAGGTGGTCCGCTGGTGTGCAAGGCAC 
TCAGTGGCCGCTGGTTCCTGGCGGGGCTGGTCAGCTGGGGCCTGGGCTGTGGCCGGCCTAAC 
TACTTCGGCGTCTACACCCGCATCACAGGTGTGATCAGCTGGATCCAGCAAGTGGTGACCIS 
&GGAACTGCCCCCCTG CAAAG CAGGGCCCACCTCCTGGACTCAGAGAGCCCAGGGCAACTGC 
CAAG CAGGGGGACAAGTATT CTGGCGGGGGGTGGGGGAGAGAGCAGGC CCTGTGGTGGCAGG 
AGGTGGCATCTTGTCTCGTCCCTGATGTCTGCTCCAGTGATGGCAGGAGGATGGAGAAGTGC 
CAGCAGCTGGGGGTCAAGACGTCCCCTGAGGACCCAGGCCCACACCCAGCCCTTCTGCCTCC 
CAATTCTCTCTCCTCCGTCCCCTTCCTCCACTGCTGCCTAATGCAAGGCAGTGGCTCAGCAG 
CAAGAATGCTGGTTCTACATCCCGAGGAGTGTCTGAGGTGCGCCCCACTCTGTACAGAGGCT 
GTTTGGGCAGCCTTGCCTCCAGAGAGCAGATTCCAGCTTCGGAAGCCCCTGGTCTAACTTGG 
GATCTGGGAATGGAAGGTGCTCCCATCGGAGGGGACCCTCAGAGCCCTGGAGACTGCCAGGT 
GGGCCTGCTGCCACTGTAAGCCAAAAGGTGGGGAAGTCCTGACTCCAGGGTCCTTGCCCCAC 
CCCTGCCTGCCACCTGGGCCCTCACAGCCCAGACCCTCACTGGGAGGTGAGCTCAGCTGCCC 
TTTGGAATAA^GCTGCCTGATCAAAAAAAAAAAAAAAAAAAAA 



WO 99/46281 



PCT/US99/05028 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49152 
xsubunit 1 of 1, 802 aa, 1 stop ' 
XMW: 88846, pi: 6.41, NX(S/T): 7 

MPVAEAPQVAGGQGDGGDGEEAEPEGMFKACEDSKRKARGYLRLVPLFVLL^ 

L WYFLGYKAEVMVS Q VY S GSLRVLNRHFS QDLTRRES SAFRSE TAKAQKMLKEL I TS TRLGT 

YYNSSSVYSFGEGPLTCFFWFILQIPEHRRLMLSPEWQALLVEELLSTVNSSAAVPYRAEY 

EVDPEGLVILEASVKDIAALNSTLGCYRYSYVGQGQVLRLKGPDHLASSCLWHLQGPKDLML 

KLRLEWTIAECRDRIJUYrYDVAGPLEKRLI^^ 

YDPFVLSVQPWFQACEWLTLDNRLDSQGVLSTPYFPSYYSPQTHCSWHLTVPSLDYGIiAL 
WFDAYALRRQKYDLPCTQGQWTIQNRRLCGLRILQPYAERIPWATAGITINFTSQISLTGP 
GVRVHYGL YNQ SDPCPGE FLCS VNGLCVPACDGVKDCPNGLDERNCVCRAT FQCKEDS TC I S 
LPKVCDGQPDCLNGSDEEQCQEGVPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCD 
CGLQGPSSRIVGGAVSSEGEWPWQASLQYRGRHICGGALIADRWITAAHCFQEDSMASTVL 
WTVFLGKVWQNSRWPGEVSFKVSRLLLHPYHEEDSHDYDVALLQLDHPWRSAAVRPVCLPA 
RSHFFEPGLHCWITGWGALREGGPISNALQKYDVQLIPQDLCSEAYRYQVTPRMLCAGYRKG 
KKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGVISWIQQWT 

Important features: 

Type 1 1 transmembrane domain : 

amino acids 4 6-67 

Serine proteases, trypsin family, histidine active site* 
amino acids 604-609 

N-glycosylation sites. 

amino acids 127-130, 175-178, 207-210, 329-332, 424-427, 444-447 
and 509-512 

Krxngle domains , 

amino acids 746-758 and 592-609 

Homologous region to Kallikrein Light Chain: 
amino acids 568-779 

Homologous region to Low-density lipoprotein receptor: 
amino acids 4 51-567 



WO 99/46281 PCT/US99/05028 

FIGU R E 6 4 

GCACCCAGGGCCAGTGGACGATCCAGAACAGGAGGCTGTGTGGCTTGCGCATCCTGCAGCCC 
.TACGCCGAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACCTCCCAGAT 
CTCCCTCACCGGGCCCGGTGTGCGGGTGCACTATGGCTTGTACAACCAGTCGGACCCCTGCC 
CTGGAGAGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGGAC 
TGCCCCAACGGCCTGGATGAGAGAAACTGCGTTTGCAGAGCCACATTCCAGTGCAAAGAGGA 
CAGCACATGCATCTCACTGCCCAAGGTCTGTGATGGGCAGCCTGATTGTCTCAACGGCAGCG 
ATGAAGAGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACCTTCCAGTGTGAGGACCGG 
AGCTGCGTGAAGAAGCCCAACCCGCAGTGTGATGGGCGGCCCGACTGCAGGGACGGCTCGGA 
TGAGGAGCACTGTGACTGTGGCCTCCAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGT 
CCTCCGAGGGTGAGTGGCCATGGCAGGCCAGCCTCCAGGTTCGGGGTCGACACATCTGTGGG 
GGGGCCCTCATCGCTGACCGCTGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCAT 
GGCCTCCACGGTGCTGTGGACCGTGTTCCTGG.GCAAGGTGTGGCAGAACTCGCGCTGGCCTG 
GAGAGGTGTCCTTCAAGGTGAGCCGCCTGCTCCTGCACCCGTACCACGAAGAGGACAGCCAT 
GACTACGACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGCCC 
CGTCTGCCTGCCCGCGCGCTCCCACTTCTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCT 
GGGGCGCCTTGCGCGAGGGCGGCCCCATCAGCAACGCTCTGCAGAA?VGTGGATGTGCAGTTG 
ATCCCACAGGACCTGTGCAGCGAGGCCTATCGCTACCAGGTGACGCCACGCATGCTGTGTGC 
CGGCTACCGCAAGGGCAAGAAGGATGCCTGTCAGGGTGACTCAGGTGGTCCGCTGGTGTGCA 
AGGCACTCAGTGGCCGCTGGTTCCTGGCGGGGCTGGTCAGCTGGGGCCTGGGCTGTGGCCGG 
CCTAACTACTTCGGCGTCTACACCCGCATCACAGGTGTGATCAGCTGGATCCAGCAAGTGGT 
GACCTGAGGAACTGCCCCCCTGCAAAGCAGGGCCCACCTCCTGGACTCAGAGAGCCCAGGGC 
AACTGC CAAGCAGGGGGACAAGTAT 
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FIGURE 65 

GGACGAGGGCAGATCTCGTTCTGGGGCAAGCCGTTGACACTCGCTCCCTGCCACCGCCCGGG 

CTCCGTGCCGCCAAGTTTTCATTTTCCACCTTCTCTGCCTCCAGTCCCCCAGCCCCTGGCCG 

AGAGAAGGGTCTTACCGGCCGGGATTGCTGGAAACACCAAGAGGTGGTTTTTGTTTTTTAAA 

ACTTCTGTTTCTTGGGAGGGGGTGTGGCGGGGCAGGAESAGCAACTCCGTTCCTCTGCTCTG 

TTTCTGGAGCCTCTGCTATTGCTTTGCTGCGGGGAGCCCCGTACCTTTTGGTCCAGAGGGAC 

GGCTGGAAGATAAGCTCCACAAACCCAAAGCTACACAGACTGAGGTCAAACCATCTGTGAGG 

TTTAACCTCCGCACCTCCAAGGACCCAGAGCATGAAGGATGCTACCTCTCCGTCGGCCACAG 
C C AG C C C T T AG AAG AC T G C AG T T T C AAC AT G AC AG C T AAAAC C T T T T T CAT CAT T C AC G GAT 

GGACGATGAGCGGTATCTTTGAAAACTGGCTGCACAAACTCGTGTCAGCCCTGCACACAAGA 
GAG AAAGAC GC C AAT G TAG T TG T G G T TGAC T G GCT C C C CCT GGCCCAC CAGC T T TAC AC GGA 

TGCGGTCAATAATACCAGGGTGGTGGGACACAGCATTGCCAGGATGCTCGACTGGCTGCAGG 
AGAAGGACGATTTTTCTCTCGGGAATGTCCACTTGATCGGCTACAGCCTCGGAGCGCACGTG 
GCCGGGTATGCAGGCAACTTCGTGAAAGGAACGGTGGGCCGAATCACAGGTTTGGATCCTGC 
C G GGC C CAT G T T T G AAG G GGC C GAC ATC CACAAGAGGC T C TC T CC GGACGAT GC AGAT T T T G 

TGGATGTCCTCCACACCTACACGCGTTCCTTCGGCTTGAGCATTGGTATTCAGATGCCTGTG 
GGCCACATTGACATCTACCCCAATGGGGGTGACTTCCAGCCAGGCTGTGGACTCAACGATGT 
C T TGGGAT C AAT T G C ATAT G GAACAAT CACAGAG G T G G T AAAATG T GAGC AT GAGC GAGC CG 

TCCACCTCTTTGTTGACTCTCTGGTGAATCAGGACAAGCCGAGTTTTGCCTTCCAGTGCACT 
GACTCCAATCGCTTCAAAAAGGGGATCTGTCTGAGCTGCCGCAAGAACCGTTGTAATAGCAT 
TGGCTACAATGCCAAGAAT^ATGAGGAACAAGAGGAACAGCAAAATGTACCTAAAAACCCGGG 
CAGGCATGCCTTTCAGAGGTAACCTTCAGTCCCTGGAGTGTCCCXS^GGAAGGCCCTTAATA 
CCTCCTTCTTAATACCATGCTGCAGAGCAGGGCACATCCTAGCCCAGGAGAAGTGGCCAGCA 
CAAT C C AAT C AAAT C G T T G C AAAT CAG AT TAC AC T G T G CAT G T C C TAGGAAAG GG AAT C T T T 
AC AAAAT AAAC AG TGTGGACCCC T AATAAAAAAAAAAAAAAAkAA&AAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 66 

>< /us r / seqdh2 /ss t /DNA/Dnaseqs . min/ s s . DNA4 9646 
xsubunit 1 of 1, 354 aa, 1 stop 
><MW: 39362, pi: 8.35, NX(S/T): 2 

MSNSVPLLCFWSLCYCFAAGSPVPFGPEGRLEDKLHKPKATQTEVKPSVRFliTLRTSKDPEHE 
GCYLS VGHSQPLEDCS FNMTAKTFF I IHGWTMSG I FE3STWIiHKLVS ALHTREKDANVVVVDWL 
PLAHQL YTDAVNNTRVVGHS I ARMLDWLQEKDDFSLGNVHLI GYSLGAHVAGYAGNFVKGTV 
GRITGLDPAGPMFEGADIHKRLSPDDADFVDVLHTYTRSFGLSIGIQMPVGHIDIYPNGGDF 
QPGCGLNDVLGSIAYGTITEWKCEHERAVHLFVDSLVHQDKPSFAFQCTDSNRFKKGICLiS 
CRKNRCNSIGYNAKKMRNKRNSKMYLKTRAGMPFRGNLQSLECP 

Important features : 
Signal peptide: 

amino acids 1-16 

Lipases, serine active site. 

amino acids 163-172 

N-glycosylation sites, 

amino acids 80-83 and 136-139 
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FIGURE 67 



CGGACGCGTGGGCGGACGCGTGGGCCTGGGCAAGGGCCGGGGCGCCGGGCCGAGCCACCTCT 
TCCCCTCCCCCGCTTCCCTGTCGCGCTCCGCTGGCTGGACGCGCTGGAGGAGTGGAGCAGCA 
CCCGGCCGGCCCTGGGGGCTGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAA 
CCCCGGCAGGTGGCGACCAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGC 
GAGGGCGCGCCCCAGTGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGC 
GGCGGCGGACGGAGAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTG 
CCGCCGCCTCCCCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGC 
CCCCCTGGGGAGGCGGCACCAGGGAGCCTGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGG 
TAGACCACAGAAGCTCCGGGACCCTTCCGGCACCTCTGGACAGCCCAGGaiGCTGTTGGCCA 
CCCTCCTCCTCCTCCTCCTTGGAGGCGCTCTGGCCCATCCAGACCGGATTATTTTTCCAAAT 
CAT G C T T G T G AGG AC C C C C C AGC AG T G C T C T TAG AAG T GCAGGGCAC C T T AC AGAGGC C C C T 

GGTCCGGGACAGCCGCACCTCCCCTGCCAACTGCACCTGGCTCATCCTGGGCAGCAAGGAAC 
AG AC T G T C AC CAT C AG G T T C CAGAAGC T ACACC T GG CC T G T GGC TC AGAGC G C T TAAC CC TA 

CGCTCCCCTCTCCAGCCACTGATCTCCCTGTGTGAGGCACCTCCCAGCCCTCTGCAGCTGCC 
CGGGGGCAACGTCACCATCACTTACAGCTATGCTGGGGCCAGAGCACCCATGGGCCAGGGCT 
TCCTGCTCTCCTACAGCCAAGATTGGCTGATGTGCCTGCAGGAAGAGTTTCAGTGCCTGAAC 
CACCGCTGTGTATCTGCTGTCCAGCGCTGTGATGGGGTTGATGCCTGTGGCGATGGCTCTGA 
TGAAGCAGGTTGCAGCTCAGACCCCTTCCCTGGCCTGACCCCAAGACCCGTCCCCTCCCTGC 
CTTGCAATGTCACCTTGGAGGACTTCTATGGGGTCTTCTCCTCTCCTGGATATACACACCTA 
GCCTCAGTCTCCCACCCCCAGTCCTGCCATTGGCTGCTGGACCCCCATGATGGCCGGCGGCT 
GGCCGTGCGCTTCACAGCCCTGGACTTGGGCTTTGGAGATGCAGTGCATGTGTATGACGGCC 
CTGGGCCCCCTGAGAGCTCCCGACTACTGCGTAGTCTCACCCACTTCAGCAATGGCAAGGCT 
GTCACTGTGGAGACACTGTCTGGCCAGGCTGTTGTGTCCTACCACACAGTTGCTTGGAGCAA 
TGGTCGTGGCTTCAATGCCACCTACCATGTGCGGGGCTATTGCTTGCCTTGGGACAGACCCT 
GTGGCTTAGGCTCTGGCCTGGGAGCTGGCGAAGGCCTAGGTGAGCGCTGCTACAGTGAGGCA 
CAGCGCTGTGACGGCTCATGGGACTGTGCTGACGGCACAGATGAGGAGGACTGCCCAGGCTG 
CCCACCTGGACACTTCCCCTGTGGGGCTGCTGGCACCTCTGGTGCCACAGCCTGCTACCTGC 
CTGCTGACCGCTGCAACTACCAGACTTTCTGTGCTGATGGAGCAGATGAGAGACGCTGTCGG 
CAT T G C C AGC C T GG C AAT T T CC GATG C CGGGAC GAGAAG T GC G T G TAT GAGACG T G GG T G T G 

CGATGGGCAGCCAGACTGTGCGGACGGCAGTGATGAGTGGGACTGCTCCTATGTTCTGCCCC 
GCAAGGTCATTACAGCTGCAGTCATTGGCAGCCTAGTGTGCGGCCTGCTCCTGGTCATCGCC 
CTGGGCTGCACCTGCAAGCTCTATGCCATTCGCACCCAGGAGTACAGCATCTTTGCCCCCCT 
CTCCCGGATGGAGGCTGAGATTGTGCAGCAGCAGGCACCCCCTTCCTACGGGCAGCTCATTG 
CCCAGGGTGCCATCCCACCTGTAGAAGACTTTCCTACAGAGAATCCTAATGATAACTCAGTG 
CTGGGCAACCTGCGTTCTCTGCTACAGATCTTACGCCAGGATATGACTCCAGGAGGTGGCCC 
AGGTGCCCGCCGTCGTCAGCGGGGCCGCTTGATGCGACGCCTGGTACGCCGTCTCCGCCGCT 
GGGGCTTGCTCCCTCGAACCAACACCCCGGCTCGGGCCTCTGAGGCCAGATCCCAGGTCACA 
CCTTCTGCTGCTCCCCTTGAGGCCCTAGATGGTGGCACAGGTCCAGCCCGTGAGGGCGGGGC 
AGTGGGTGGGCAAGATGGGGAGCAGGCACCCCCACTGCCCATCAAGGCTCCCCTCCCATCTG 
CTAGCACGTCTCCAGCCCCCACTACTGTCCCTGAAGCCCCAGGGCCACTGCCCTCACTGCCC 
CTAGAGCCATCACTATTGTCTGGAGTGGTGCAGGCCCTGCGAGGCCGCCTGTTGCCCAGCCT 
GGGGCCCCCAGGACCAACCCGGAGCCCCCCTGGACCCCACACAGCAGTCCTGGCCCTGGAAG 
ATGAGGACGATGTGCTACTGGTGCCACTGGCTGAGCCGGGGGTGTGGGTAGCTGAGGCAGAG 
GATGAGCCACTGCTTACCTG^GGGGACCTGGGGGCTCTACTGAGGCCTCTCCCCTGGGGGCT 
CTACTCATAGTGGCACAACCTTTTAGAGGTGGGTCAGCCTCCCCTCCACCACTTCCTTCCCT 
GTCCCTGGATTTCAGGGACTTGGTGGGCCTCCCGTTGACCCTATGTAGCTGCTATAAAGTTA 
AGTGTCCCTCAGGCAGGGAGAGGGCTCACAGAGTCTCCTCTGTACGTGGCCATGGCCAGACA 
CCCCAGTCCCTTCACCACCACCTGCTCCCCACGCCACCACCP.TTTGGGTGGCTGTTT^TAAA 
AAG T AAAG T T C T TAG AG GAT CATAGG T C T GG ACAC T CCAT C C T T GC CAAACC T C T AC C CAA& 
AG T GG C C T T AAGC AC C G G AAT GC CAAT TAAC T AGAGAC C C T C C AGC C C C CAAGGGG AG GAT T 
T GGG C AG AAC C T GAG G T T T T GCCAT C C AC AAT C C C T CCT AC AG GGCCTGGCT CACAAAAAGA 
G T GC AAC AAAT G C T T C TAT T CCAT AG C T AC G GC AT T G C T C AG T AAG T T G AGG T C A&AAAT AA 
AGGAATCATACATCTC 
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FIGURE 68 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA49631 
<subunit 1 of 1, 713 aa, 1 stop 
<MW: 76193, pi: 5.42, NX(S/T): 4 

MLLATLLLLLLGGALAHPDRIIFPNHACEDPPAVLLEVQGTLQRPLVRDSRTSPANCTWLIL 
GSKEQTVTIRFQKLHLACGSERLTLRSPLQPLISLCEAPPSPLQLPGGNVTITYSYAGARAP 
MGQGFLLSYSQDWLMCLQEEFQCLNHRCVSAVQRCDGVDACGDGSDEAGCSSDPFPGLTPRP 
VPSLPCNVTLEDFYGVFSSPGYTKIASVSHPQSCHWLLDPHDGRRLAVRFTALDLGFGDAVH 
VYDGPGPPESSRLLRSLTHFSNGKAVTVETLSGQAWSYHTVAWSNGRGFNATYHVRGYCLP 
WDRPCGLGSGLGAGEGLGERCYSEAQRCDGSWDCADGTDEEDCPGCPPGHFPCGAAGTSGAT 
ACYLPADRCNYQTFCADGADERRCRHCQPGNFRCRDEKCVYETWVCDGQPDCADGSDEWDCS 
YVLPRKVI TAAVI GS LVCGLLLVIALGCTCKLYAIRTQE YS I FAPLSRMEAE IVQQQAP PSY 

GQLIAQGAIPPVEDFPTENPNDNSVLGNLRSLLQILRQDMTPGGGPGARRRQRGRLMRRLVR 
RLRRWGLLPRTNTPARASEARSQVTPSAAPLEALDGGTGPAREGGAVGGQDGEQAPPLPIKA 
PLPSASTSPAPTTVPEAPGPLPSLPLEPSLLSGWQALRGRLLPSLGPPGPTRSPPGPHTAV 
J^ALEDEDDVLLVPLAEPGVWVAEAEDEPLLT 

Important features : 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 442-462 

LDL-receptor class A (LDLRA) domain proteins 

amino acids 411-431, 152-171, 331-350 and 374-393 
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e 




CGAGCTGGGCGAGAAGTAGGGGAGGGCGGTGCTCCGCCGCGGTGGCGGTTGCTATCGCTTCG 
CAGAACCTACTCAGGCAGCCAGCTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCA 
GACGCGAISGATAACGTGCAGCCGAAAATAAAACATCGCCCCTTCTGCTTCAGTGTGAAAGG 
CCACGTGAAGATGCTGCGGCTGGCACTAACTGTGACATCTATGACCTTTTTTATCATCGCAC 
AAGCCCCTGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATA 
C T T T T AT ATG T AC T C AGAC T TGAT C GAT TAAT GAAGTGG T TAT TTTGGCCTTTGCTT GAT AT 
TAT C AAC T CAC T G G T AAC AAC AGT AT TC AT GC T CAT CGT AT CT G TG T T GGCAC TGATACCAG 
AAACCACAACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGACAGCAGTATGCTGTCTT 
GCCGACGGGGCCCTTATTTACCGGAAGCTTCTGTTCAATCCCAGCGGTCCTTACCAGAAAAA 
GCCTGTGCATGAAAAAAAAGAAGTTTTG^^TTTTATATTACTTTTTAGTTTGATACTAAGT 
ATTAAACATATTTCTGTATTCTTCCAAAAAAAA&AAAAAAAAA 
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FIGURE 70 



X/usr/seqdb2/sst/DNA/Dnaseqs*min/ss,DNA49645 
Xsubunit 1 of 1, 152 aa, 1 stop 
><MW: 17170, pi: 9.62, NX{S/T): 1 

MDNVQPKIKHRPFCFSVKGHVKMLRI^TVTSHTFFIIAQAPEPYIVITGFEVTVILFFILL 

YVLRLDRMKWLFWPLLDIINSLVTTVmLIVSVLALIPETTTLTVGGGVFALVTAVCCLAD 
GALIYRKLL FNPS GPYQKKPVHEKKEVL 

Important features : 

Potential type II transmembrane domain: 

amino acids 2 6-42 

Other potential transmembrane domain: 

amino acids 44-65, 81-101 and 109-129 

Leucine zipper pattern 

amino acids 78-99 and 85-106 

N-myristoylation site. 

amino acids 110-115 

Ribonucleotide reductase large subunit protein 

amino acids 116-127 
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FIGURE 71 



GGGCGAGAAGTAGGGGAGGGCGTGTTCCGCCGCGGTGGCGGTTGCTATCGTTTTGCAGAACC 
TACTCAGGCAGCCAGNTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCAGACGCGA 
TGGATAACGTGCAGCCGAAAATAAAACATCGCCCCTTCTGCTTCAGTGTGAAAGGCCACGTG 
AAGATG CTGCGGCTGGC ACTAACTGNGACATCTATGACCTTTTTTATNATCGCACAAG CCCC 
TGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATACTTTTAT 
ATGTACTCAGACTTGATCGATTAATGAAGTGGTTATTTTGGCCTTTGCTTGATATTATCAAC 
TCACTGGTAACAACAGTATTCATGCTCATCGTATCTGTGTTGGC^CTGATACCAGAAACCAC 
AACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGACAGCAGTATGCTGTNTTGCCGAC 





WO 99/46281 



PCT/US99/05028 



FIG U RE 72 



CAGCCCCGCGCGCCGGCCGAGTCGCTGAGCCGCGGCTGCCGGACGGGACGGGACCGGCTAGG 
CTGGGCGCGCCCCCCGGGCCCCGCCGTGGGCAISGGCGCACTGGCCCGGGCGCTGCTGCTGC 
CTCTGCTGGCCCAGTGGCTCCTGCGCGCCGCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTG 
CCCCTCCGGGTGGCCGCGGCCACGAACCGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCC 
TGCCGAGCGCCACGCCGACGGCTTGGCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGG 
GCGCCGCCAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTAC 
C T GGAGAT G C T GAT C G G G AC C C CCCCGCAG AAGC T ACAGAT T C T C G T T GAC AC T GGAAGCAG 
TAACT T TGCCG T GGCAGGAACCCCGCAC TCC TACATAGACACG TAC TTTGACACAGAGAGG T 
C TAGC AC ATAC C GC T CCAAGGGCTTTGACG TCACAG T GAAG TACACACAAGGAAGCTGGACG 
GGCTTCGTTGGGGAAGACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAA 
CATTGCCACTATTTTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATAC 
TTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCC 
CTGGTGACACAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCC 
CGTTGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGT 
ATAAAG GAG AC AT C T G G TAT AC CCC TAT T AAGGAAG AG T G G TAC TACCAGAT AGAAAT T C T G 
AAAT TGGAAAT T GGAGGC CAAAGCCTTAATCT GGAC TGC AGAGAGTATAACGCAGACAAGGC 
CATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTGG 
AAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCCCAG 
CTGGCG TGC T GGAC GAATTCGGAAAGACCTTGGTCT TAC TTCCCTAAAATCTCCATCTACCT 
GAGAGAC GAGAAC T C C AGCAGG TCATTCCGT AT CACAAT C CTGCCTCAGC T TTACAT T CAGC 
CCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCATCCACAAAT 
GCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGACAGAGCCCAGAA 
GAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTT 
CCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCTCAGTCTTTGAGCGAG 
CCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGAGCCATCCTCCTTGTCTT 
AATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCA 
AT GAT GAG TCCTCTCTGGT CAGACAT CGCT GG AAATG&AT AGCC AGGCC T GAC C T C AAGC AA 
CCAT GAAC T C AG C TAT T AAG AAAAT C AC AT T T C C AG GG CAG C AGCC GGGAT C GAT G G T G GC G 
CTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCTGCTCCCAGATGCCTTCTAGATTCAC 

TGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATCCTCCCTACTTCCAAGAAAAATAATTAA 
AAAAAAAACTTCATTCTAA 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45493 
Xsubunit 1 of 1, 518 aa r 1 stop 
XMW: 56180, pi: 5.08, NX(S/T); 2 
MGAIJIRALLLPLLAQWLLRAAPEIAPAP 

ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHS 
YIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENF 
FLPG I KWNG I LGLAYAT LAKPS S S LE T FFDS LVTQANI PNVFSMQMCGAGLPVAGS GTNGGS 
LVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLR 
L P QKVFDAWE AVARAS L I PE FS DG FWTG S QLAC WTNS E T PW S YFPKI S I YLRDENS S RS FR 
I TILPQLY I QPMMGAGLNYECYRFGI SPSTNALVIGATVMEGFYVI FDRAQKRVGFAAS PCA 

EIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRC 
QRRPRDPE WNDE S S LVRHRWK 

Important features: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 466-494 

N-glyaosylation sites. 

amino acids 170-173 and 366-369 

Leucine zipper pattern. 

amino acids 10-31 and 197-118 

Eukaryotic and viral aspartyl proteases 

amino acids 109-118, 252-261 and 298-310 
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FIGURE 74 

CGCCTCCGCCTTCGGAGGCTGACGCGCCCGGGCGCCGTTCCAGGCCTGTGCAGGGCGGATCG 
GCAGCCGCCTGGCGGCGATCCAGGGCGGTGCGGGGCCTGGGCGGGAGCCGGGAGGCGCGGCC 
GGCAISGAGGCGCTGCTGCTGGGCGCGGGGTTGCTGCTGGGCGCTTACGTGCTTGTCTACTA 
CAACCTGGTGAAGGCCCCGCCGTGCGGGGGCATGGGCAACCTGCGGGGCCGCACGGCCGTGG 
TCACGGGCGCCAACAGCGGCATCGGAAAGATGACGGCGCTGGAGCTGGCGCGCCGGGGAGCG 
CGCGTGGTGCTGGCCTGCCGCAGCCAGGAGCGCGGGGAGGCGGCTGCCTTCGACCTCCGCCA 
GGAGAGTGGGAACAATGAGGTCATCTTCATGGCCTTGGACTTGGCCAGTCTGGCCTCGGTGC 
GGGCCTTTGCCACTGCCTTTCTGAGCTCTGAGCCACGGTTGGACATCCTCATCCACAATGCC 
GGTATCAGTTCCTGTGGCCGGACCCGTGAGGCGTTTAACCTGCTGCTTCGGGTGAACCATAT 
CGGTCCCTTTCTGCTGACACATCTGCTGCTGCCTTGCCTGAAGGCATGTGCCCCTAGCCGCG 
TGGTGGTGGTAGCCTCAGCTGCCCACTGTCGGGGACGTCTTGACTTCAAACGCCTGGACCGC 
CCAGTGGTGGGCTGGCGGCAGGAGCTGCGGGCATATGCTGACACTAAGCTGGCTAATGTACT 
GTTTGCCCGGGAGCTCGCCAACCAGCTTGAGGCCACTGGCGTCACCTGCTATGCAGCCCACC 
CAGGGCCTGTGAACTCGGAGCTGTTCCTGCGCCATGTTCCTGGATGGCTGCGCCCACTTTTG 
CGCCCATTGGCTTGGCTGGTGCTCCGGGCACCAAGAGGGGGTGCCCAGACACCCCTGTATTG 
TGCTCTACAAGAGGGCATCGAGCCCCTCAGTGGGAGATATTTTGCCAACTGCCATGTGGAAG 
AG GUGCCTC C AGC T G C C C G AGAC GAC C GGG CAGC CCAT C GG C T AT GGG AGGCC AG C AAGAGG 
C T GG CAG GG C TTGGGCCTG GG GAGGAT GC T GAACCCGAT GAAGAC C C C CAG TC T G AGGAC TC 
AGAGGCCCCATCTTCTCTAAGCACCCCCCACCCTGAGGAGCCCACAGTTTCTCAACCTTACC 
C CAG C C C T C AGAG C T C AC C AGAT T T G T C T AAG AT GAC G CAC C GAAT T CAGGC T AAAG T T GAG 
CCTGAGATCCAGCTCTCCI^ACCCTCAGGCCAGGATGCTTGCCATGGCACTTCATGGTCCTT 
GAAAACCTCGGATGTGTGTGAGGCCATGCCCTGGACACTGACGGGTTTGTGATCTTGACCTC 
CGTGGTTACTTTCTGGGGCCCCAAGCTGTGCCCTGGACATCTCTTTTCCTGGTTGAAGGAAT 
AATGGG T GATTATTTC T T CCTGAGAGTGACAGTAACCCCAGATGGAGAGATAGGGGTATGCT 
AGACACTGTGCTTCTCGGAAATTTGGATGTAGTATTTTCAGGCCCCACCCTTATTGATTCTG 
AT CAGC T C T G GAG CAG AG G C AGGGAG T T T GCAAT G TGAT GC AC T GC CAAC AT T GAGAAT TAG 
TGAACTGATCCCTTTGCAACCGTCTAGCTAGGTAGTTAAATTACCCCCATGTTAATGAAGCG 
GAATTAGGCTCCCGAGCTAAGGGACTCGCCTAGGGTCTCACAGTGAGTAGGAGGAGGGCCTG 
GGATCTGAACCCAAGGGTCTGAGGCCAGGGCCGACTGCCGTAAGATGGGTGCTGAGAAGTGA 
GTCAGGGCAGGGCAGCTGGTATCGAGGTGCCCCATGGGAGTAAGGGGACGCCTTCCGGGCGG 
ATGCAGGGCTGGGGTCATCTGTATCTGAAGCCCCTCGGAATAAAGCGCGTTGACCGCCAAAA 
AAAAAAAAAAAAAAAAA 
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</usr/5eqdb2/sst/DNA/Dnaseqs.mirx/ss -DNA48227 
<subunit 1 of 1, 377 aa, 1 stop 
<MW: 40849, pi: 7.98, NX(S/T): 0 
MEALLLGAGLLLGAYVLVYYNLVKAPPCGG 

WLACRSQERGEAAAFDLRQESGNNEVIFMALDIAS1A.SVKAFATAFLSSEPR 

ISSCGRTREAFNLLLRVNHIGPFLLTHLLLPCLKACAPSRWWASAAHCRGRLDFKRLDRP 
WGWRQELRAYADTKLAWLFARELANQLEATGVTCYAAHPGPVNSELFLRHVPGWLR 

PLAWLVLRAPRGGAQTPLYCALQEGIEPLSGRYFANCHVEEVPPAARDDRAAHRLWEASKRL 

AGLGPGEDAEPDEDPQSEDSEAPSSLSTPHPEEPTVSQPYPSPQSSPDLSKMTHRIQAKVEP 
EIQLS 

Important features: 
Signal peptide: 

amino acids 1-16 



Giycosaminoglycan attachment site. 

amino acids 4 6-4 9 



Short-chain alcohol dehydrogenase family 

amino acids 37-49 and 114-124 
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FIGURE 76A 

GGAGGAGACAGCCTCCTGGGGGGCAGGGGTTCCCTGCCTCTGCTGCTCCTGCTCATC&ISGG 

AGGCATGGCTCAGGACTCCCCGCCCCAGATCCTAGTCCACCCCCAGGACCAGCTGTTCCAGG 

GCCCTGGCCCTGCCAGGATGAGCTGCCAAGCCTCAGGCCAGCCACCTCCCACCATCCGCTGG 

TTGCTGAATGGGCAGCCCCTGAGCATGGTGCCCCCAGACCCACACCACCTCCTGCCTGATGG 

GACCCTTCTGCTGCTACAGCGCCCTGCCCGGGGACATGCCCACGATGGCCAGGCCCTGTCCA 

CAGACCTGGGTGTCTACACATGTGAGGCCAGCAACCGGCTTGGCACGGCAGTCAGCAGAGGC 

GCTCGGCTGTCTGTGGCTGTCCTCCGGGAGGATTTCCAGATCCAGCCTCGGGACATGGTGGC 

TGTGGTGGGTGAGCAGTTTACTCTGGAATGTGGGCCGCCCTGGGGCCACCCAGAGCCCACAG 

TCTCATGGTGGAAAGATGGGAAACCCCTGGCCCTCCAGCCCGGAAGGCACACAGTGTCCGGG 
GGGTCCCTGCT GAT G G C AAG AGC AG AG AAGAG T GACGAAGG GACC T AC AT GTGTG T GGCCAC 

CAACAGCGCAGGACATAGGGAGAGCCGCGCAGCCCGGGTTTCCATCCAGGAGCCCCAGGACT 
ACACGGAGCCTGTGGAGCTTCTGGCTGTGCGAATTCAGCTGGAAAATGTGACACTGCTGAAC 
CCGGATCCTGCAGAGGGCCCCAAGCCTAGACCGGCGGTGTGGCTCAGCTGGAAGGTCAGTGG 
CCCTGCT.GCGCCTGCCCAATCTTACACGGCCTTGTTCAGGACCCAGACTGCCCCGGGAGGCC 
AGGGAGCTCCGTGGGCAGAGGAGCTGCTGGCCGGCTGGCAGAGCGCAGAGCTTGGAGGCCTC 
CACTGGGGCCAAGACTACGAGTTCAAAGTGAGACCATCCTCTGGCCGGGCTCGAGGCCCTGA 
CAGCAACGTGCTGCTCCTGAGGCTGCCGGAAAAAGTGCCCAGTGCCCCACCTCAGGAAGTGA 
CTCTAAAGCCTGGCAATGGCACTGTCTTTGTGAGCTGGGTCCCACCACCTGCTGAAAACCAC 
AATGGCATCATCCGTGGCTACCAGGTCTGGAGCCTGGGCAACACATCACTGCCACCAGCCAA 
CTGGACTGTAGTTGGTGAGCAGACCCAGCTGGAAATCGCCACCCATATGCCAGGCTCCTACT 
GCGTGCAAGTGGCTGCAGTCACTGGTGCTGGAGCTGGGGAGCCCAGTAGACCTGTCTGCCTC 
C T T T TAG AGC AG G C CAT G GAG C GAG C C AC C CAAGAACCCAG T GAGCAT GGTCCCT GGAC CC T 

GGAGCAGCTGAGGGCTACCTTGAAGCGGCCTGAGGTCATTGCCACCTGCGGTGTTGCACTCT 
GGCTGCTGCTTCTGGGCACCGCCGTGTGTATCCACCGCCGGCGCCGAGCTAGGGTGCACCTG 
G G C C C AG G T C T G T AC AGAT AT AC C AG T GAGGAT GC CAT C C TAAAAC AC AG GAT GGATC ACAG 

TGACTCCCAGTGGTTGGCAGACACTTGGCGTTCCACCTCTGGCTCTCGGGACCTGAGCAGCA 
GCAGCAGCCTCAGCAGTCGGCTGGGGGCGGATGCCCGGGACCCACTAGACTGTCGTCGCTCC 
TTGCTCTCCTGGGACTCCCGAAGCCCCGGCGTGCCCCTGCTTCCAGACACCAGCACTTTTTA 
TGGCTCCCTCATCGCTGAGCTGCCCTCCAGTACCCCAGCCAGGCCAAGTCCCCAGGTCCCAG 
CTGTCAGGCGCCTCCCACCCCAGCTGGCCCAGCTCTCCAGCCCCTGTTCCAGCTCAGACAGC 
CTCTGCAGCCGCAGGGGACTCTCTTCTCCCCGCTTGTCTCTGGCCCCTGCAGAGGCTTGGAA 
GGCCAAAAAGAAGCAGGAGCTGCAGCATGCCAACAGTTCCCCACTGCTCCGGGGCAGCCACT 
CCTTGGAGCTCCGGGCCTGTGAGTTAGGAAATAGAGGTTCCAAGAACCTTTCCCAAAGCCCA 
GGAGCTGTGCCCCAAGCTCTGGTTGCCTGGCGGGCCCTGGGACCGAAACTCCTCAGCTCCTC 
AAATGAGCTGGTTACTCGTCATCTCCCTCCAGCACCCCTCTTTCCTCATGAAACTCCCCCAA 
CTCAGAGTCAACAGACCCAGCCTCCGGTGGCACCACAGGCTCCCTCCTCCATCCTGCTGCCA 
GCAGCCCCCATCCCCATCCTTAGCCCCTGCAGTCCCCCTAGCCCCCAGGCCTCTTCCCTCTC 
TGGCCCCAGCCCAGCTTCCAGTCGCCTGTCCAGCTCCTCACTGTCATCCCTGGGGGAGGATC 
AAGACAGCGTGCTGACCCCTGAGGAGGTAGCCCTGTGCTTGGAACTCAGTGAGGGTGAGGAG 
ACTCCCAGGAACAGCGTCTCTCCCATGCCAAGGGCTCCTTCACCCCCCACCACCTATGGGTA 
CAT C AG C G T C C C AAC AG C C T CAGAG T T C AC GGAC ATGGGCAGGAC T GG AGGAG GGG T G GGGC 
CCAAGGGGGGAGTCTTGCTGTGCCCACCTCGGCCCTGCCTCACCCCCACCCCCAGCGAGGGC 
TCCTTAGCCAATGGTTGGGGCTCAGCCTCTGAGGACAATGCCGCCAGCGCCAGAGCCAGCCT 
TGTCAGCTCCTCCGATGGCTCCTTCCTCGCTGATGCTCACTTTGCCCGGGCCCTGGCAGTGG 
CTGTGGATAGCTTTGGTTTCGGTCTAGAGCCCAGGGAGGCAGACTGCGTCTTCATAGATGCC 
TCATCACCTCCCTCCCCACGGGATGAGATCTTCCTGACCCCCAACCTCTCCCTGCCCCTGTG 
GGAGTGGAGGCCAGACTGGTTGGAAGACATGGAGGTCAGCCACACCCAGCGGCTGGGAAGGG 
GGATGCCTCCCTGGCCCCCTGACTCTCAGATCTCTTCCCAGAGAAGTCAGCTCCACTGTCGT 
ATGCCCAAGGCTGGTGCTTCTCCTGTAGATTACTCCIG&ACCGTGTCCCTGAGACTTCCCAG 
ACGGGAATCAGAACCACTTCTCCTGTCCACCCACAAGACCTGGGCTGTGGTGTGTGGGTCTT 

GGCCTGTGTTTCTCTGCAGCTGGGGTCCACCTTCCCAAGCCTCCAGAGAGTTCTCCCTCCAC 
GAT T G T G AAAAC AAAT G AAAACAAAAT T AGAGC AAAG C T GAC C T GGAG C C C T C AGG GAGC AA 

AACATCATCTCCACCTGACTCCTAGCCACTGCTTTCTCCTCTGTGCCATCCACTCCCACCAC 
CAGGTTGTTTTGGCCTGAGGAGCAGCCCTGCCTGCTGCTCTTCCCCCACCATTTGGATCACA 
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FIGURE 76B 



GGAAGTGGAGGAGCCAGAGGTGCCTTTGTGGAGGACAGCAGTGGCTGCTGGGAGAGGGCTGT 
GGAGGAAGGAGCTTCTCGGAGCCCCCTCTCAGCCTTACCTGGGCCCCTCCTCTAGAGAAGAG 
CTCAACTCTCTCCCAACCTCACCATGGAAAGAAAATAATTATGAATGCCACTGAGGCACTGA 

GGCCCTACCTCATGCCAAACAAAGGGTTCAAGGCTGGGTCTAGCGAGGATGCTGAAGGAAGG 
GAGG TAT GAG AC C G TAG G T C AAAAGC ACCAT CC T C G TAC T G TT G T C AC T ATG AGC T T AAGAA 
ATT T GAT AC C ATAAAAT G G T AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO 99/46281 



PCT7US99/05028 



FIGURE 77 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA41404 
<subunit 1 of 1, 985 aa, 1 stop 
<MW: 105336, pi: 6.55, NX(S/T): 7 

MGGMAQDS PPQ I LVHPQDQL FQGPGPARMSCQASGQPPPTIRWLLNGQPLSMVPPDPHHLLP 
DGTLLLLQPPARGHAHDGQALSTDLGVYTCEASNRLGTAVSRGARLSVAVLREDFQIQPRDM 
VAWGEQFTLECGPPWGHPEPTVSWWKDGKPIAiQPGRHTVSGGSLimRAEKSDEGTYMCV 
ATNSAGHRESRAARVSIQEPQDYTEPVELLAWIQLEWTLLNPDPAEGPKPRPAVWLSWKV 
SGPAAPAQSYTALFRTQTAPGGQGAPWAEELLAGWQSAELGGLHWGQDYEFKVRPSSGRARG 
PDSWLLLRLPEKVPSAPPQEVTLKPGNGTVFVSWVPPPAENHNGIIRGYQVWSLGNTSLPP 
ANWTWGEQTQLEIATHMPGSYCVQVAAVTGAGAGEPSRPVCLLLEQAMERATQEPSEHGPW 
TLEQLRATLKRPEVIATCGVALWLLLLGTAVCIHRRRRARVHLGPGLYRYTSEDAILKHRMD 
HSDSQWLADTWRSTSGSRDLSSSSSLSSRLGADARDPLDCRRSLLSWDSRSPGVPLLPDTST 
FYGSLIAELPSSTPARPSPQVPAVRRLPPQLAQLSSPCSSSDSLCSRRGLSSPRLSLAPAEA 
WKAKKKQE LQHANS S PLLRGSHS LELRACELGNRGSKNLSQS P GAVP QAL VAWRALG PKLLS 

SSNELVTRHLPPAPLFPHETPPTQSQQTQPPVAPQAPSSILLPAAPIPILSPCSPPSPQASS 
LSGPSPASSRLSSSSLSSLGEDQDSVLTPEEV7U.CLELSEGEETPRNSVSPMPRAPSPPTTY 
GYISVPTASEFTDMGRTGGGVGPKGGVLLCPPRPCLTPTPSEGSLANGWGSASEDNAASARA 
S LVS SSDGS FL AD AH FARAL AVAVD S FG FGLE PRE ADC VFI DAS S PPS PRDE I FLTPNLSLP 

LWEWRPDWLEDMEVSHTQRLGRGMPPWPPDSQISSQRSQLHCRMPKAGASPVDYS 



Important features : 
Transmembrane domain: 

amino acids 4 48-467 



N-glycosylation sites: 

amino acids 224-227, 338-341, 367-370, 374-377, 658-661 and 926- 
929 

N-myristoylation sites. 

amino acids 47-52, 80-85, 88-93, 99-104, 105-110, 181-186, 272- 
277, 290-295, 355-360, 403-408, 462-467, 561-566, 652-657, 849- 
854 and 876-881 

Phospho tyrosine interaction domain proteins 

amino acids 740-753 
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FI GUR E 7 8 

CTCCCACGGTGTCCAGCGCCCAGAMSCGGCTTCTGGTCCTGCTATGGGGTTGCCTGCTGCT 
CCCAGGTTATGAAGCCCTGGAGGGCCCAGAGGAAATCAGCGGGTTCGAAGGGGACACTGTGT 
CCCTGCAGTGCACCTACAGGGAAGAGCTGAGGGACCACCGGAAGTACTGGTGCAGGAAGGGT 
GGGATCCTCTTCTCTCGCTGCTCTGGCACCATCTATGCAGAAGAAGAAGGCCAGGAGACAAT 
GAAGGGCAGGGTGTCCATCCGTGACAGCCGCCAGGAGCTCTCGCTCATTGTGACCCTGTGGA 
ACCTCACCCTGCAAGACGCTGGGGAGTACTGGTGTGGGGTCGAAAAACGGGGCCCCGATGAG 
TCTTTACTGATCTCTCTGTTCGTCTTTCCAGGACCCTGCtGTCCTCCCTCCCCTTCTCCCAC 
CTTCCAGCCTCTGGCTACAACACGCCTGCAGCCCAAGGCAAAAGCTCAGCAAACCCAGCCCC 
CAGGAT TGACT TC TCC TGGGCTCTACCCGGCAGCCACCACAGCCAAGCAGGGGAAGACAGGG 
GCTGAGGCCCCTCCATTGCCAGGGACTTCCCAGTACGGGCACGAAAGGACTTCTCAGTACAC 
AGGAACCTCTCCTCACCCAGCGACCTCTCCTCCTGCAGGGAGCTCCCGCCCCCCCATGCAGC 
TGGACTCCACCTCAGCAGAGGACACCAGTCCAGCTCTCAGCAGTGGCAGCTCTAAGCCCAGG 
GTGTCCATCCCGATGGTCCGCATACTGGCCCCAGTCCTGGTGCTGCTGAGCCTTCTGTCAGC 
CGCAGGCCTGATCGCCTTCTGCAGCCACCTGCTCCTGTGGAGAAAGGAAGCTCAACAGGCCA 
CGGAGACACAGAGGAACGAGAAGTTCTGGCTGTCACGCTTGACTGCGGAGGAAAAGGAAGCC 
CCTTCCCAGGCCCCTGAGGGGGACGTGATCTCGATGCCTCCCCTCCACACATCTGAGGAGGA 
GCTGGGCTTCTCGAAGTTTGTCTCAGCGISSGGCAGGAGGCCCTCCTGGCCAGGCCAGCAGT 
GAAGCAGTATGGCTGGCTGGATCAGCACCGATTCCCGAAAGCTTTCCACCTCAGCCTCAGAG 
TCCAGCTGCCCGGACTCCAGGGCTCTCCCCACCCTCCCCAGGCTCTCCTCTTGCATGTTCCA 
GCCTGACCTAGAAGCGTTTGTCAGCCCTGGAGCCCAGAGCGGTGGCCTTGCTCTTCCGGCTG 
G AGAC T G GG AC AT C C C T GAT AGG T TCACAT C C CT GGGCAGAG TAC CAGG C TG C T G AC C C T C A 

GCAGGGCCAGACAAGGCTCAGTGGATCTGGTCTGAGTTTCAATCTGCCAGGAACTCCTGGGC 
CTCATGCCCAGTGTCGGACCCTGCCTTCCTCCCACTCCAGACCCCACCTTGTCTTCCCTCCC 
TGGCGTCCTCAGACTTAGTCCCACGGTCTCCTGCATCAGCTGGTGATGAAGAGGAGCATGCT 
GGGGTGAGACTGGGATTCTGGCTTCTCTTTGAACCACCTGCATCCAGCCCTTCAGGAAGCCT 
GTGAAAAACGTGATTCCTGGCCCCACCAAGACCCACCAAAACCATCTCTGGGCTTGGTGCAG 
GACTCTGAATTCTAACAATGCCCAGTGACTGTCGCACTTGAGTTTGAGGGCCAGTGGGCCTG 
ATGAACGCTCACACCCCTTCAGCTTAGAGTCTGCATTTGGGCT.GTGACGTCTCCACCTGCCC 
CAATAGATCTGCTCTGTCTGCGACACCAGATCCACGTGGGGACTCCCCTGAGGCCTGCTAAG 
TCCAGGCCTTGGT CAGG T CAGG TGCAC AT TGCAGGATAAGCCCAGGACCGGCACAGAAGTGG 
TTGCCTTTNCCATTTGCCCTCCCTGGNCCATGCCTTCTTGCCTTTGGAAAAAATGATGAAGA 
AAACCTTGGCTCCTTCCTTGTCTGGAAAGGGTTACTTGCCTATGGGTTCTGGTGGCTAGAGA 
GAAAAG T AGAAAAC C AGAG T GC AC G T AGG T G T C T AACACAG AGGAGAG TAG GAAC AGGGCGG 
ATACCTGAAGGTGACTCCGAGTCCAGCCCCCTGGAGAAGGGGTCGGGGGTGGTGGTAAAGTA 
GCACAACTACTATTTTTTTTCTTTTTCCATTATTATTGTTTTTTAAGACAGAATCTCGTGCT 
GCTGCCCAGGCTGGAGTGCAGTGGCACGATCTGCAAACTCCGCCTCCTGGGTTCAAGTGATT 
CTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCACCACACCTGGCTAATT 
TTTGTACTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGAC 
CTCAAATGAGCCTCCTGCTTCAGTCTCCCAAATTGCCGGGATTACAGGCATGAGCCACTGTG 
TCTGGCCCTATTTCCTTTAAAAAGTGAAATTAAGAGTTGTTCAGTATGCAAAACTTGGAAAG 
A T G GAG G AG AAAAAG AAAAG GAAGAAAAAAAT G T C AC C CAT AG T C T C AC CAGAGAC TAT CAT 

TATTTCGTTTTGTTGTACTTCCTTCCACTCTTTTCTTCTTCACATAATTTGCCGGTGTTCTT 

TTTACAGAGCAATTATCTTGTATATACAACTTTGTATCCTGCCTTTTCCACCTTATCGTTCC 

ATCACTTTATTCCAGCACTTCTCTGTGTTTTACAGACCTTTTTATAAATAAAATGTTCATCA 
GCTGCATAAAAAAAAAAAAAA 
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FIGURE 79 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss , DNA44196 
<subunit 1 of 1, 332 aa, 1 stop 
<MW: 36143, pi: 5.89, NX(S/T): 1 

MRLLVLLWGCLLLPGYEALEGPEEISGFEGDTVSLQCTYREELRDHRKYWCRKGGILFSRCS 
GTIYAEEEGQETMKGRVSIRDSRQELSLIVTLWNLTLQDAGEYWCGVEKRGPDESLLISLFV 



FPGPCCPPSPSPTFQPLATTRLQPKAKAQQTQPPGLTSPGLYPAATTAKQGKTGAEAPPLPG 

TSQYGHERTSQYTGTSPHPATSPPAGSSRPPMQLDSTSAEDTSPALSSGSSKPRVSIPMVRI 

LAPVLVLLSLLSAAGLIAFCSHLLLWRKEAQQATETQRNEKFWLSRLTAEEKEAPSQAPEGD 
VISMPPLHTSEEELGFSKFVSA 

Important features: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 
amino acids 248-269 

N-glycosyiation site. 

amino acids 9 6-99 

Fibrinogen beta and gamma chains C- terminal domain. 

amino acids 104-113 

Ig like V-type domain: 
amino acids 13-128 
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FIGURE 80 

TTGTGACTAAAAGCTGGCCTAGCAGGCCAGGGAGTGCAGCTGCAGGCGTGGGGGTGGCAGGA 

GCCGCAGAGCCAGAGCAGACAGCCGAGAAACAGGTGGACAGTGTGAAAGAACCAGTGGTCTC 

GCTCTGTTGCCCAGGCTAGAGTGTACTGGCGTGATCATAGCTCACTGCAGCCTCAGACTCCT 

GGACTTGAGAAATCCTCCTGCCTTAGCCTCCTGCATATCTGGGACTCCAGGGGTGCACTCAA 

GCCCTGTTTCTTCTCCTTCTGTGAGTGGACCACGGAGGCTGGTGAGCTGCCTGTCATCCCAA 

AGCTCAGCTCTGAGCCAGAGTGGTGGTGGCTCCACCTC7GCCGCCGGCATAGAAGCCAGGAG 

CAGGGCTCTCAGAAGGCGGTGGTGCCCAGCTGGGATC^ISTTGTTGGCCCTGGTCTGTCTGC 

TCAGCTGCCTGCTACCCTCCAGTGAGGCCAAGCTCTACGGTCGTTGTGAACTGGCCAGAGTG 

CTACATGACTTCGGGCTGGACGGATACCGGGGATACAGCCTGGCTGACTGGGTCTGCCTTGC 

TTATTTCACAAGCGGTTTCAACGCAGCTGCTTTGGACTACGAGGCTGATGGGAGCACCAACA 

ACGGGATCTTCCAGATCAACAGCCGGAGGTGGTGCAGCAACCTO^CCCCGAACGTCCCCAAC 

GTGTGCCGGATGTACTGCTCAGATTTGTTGAATCCTAATCTCAAGGATACCGTTATCTGTGC 

CATGAAGATAACCCAAGAGCCTCAGGGTCTGGGTTACTGGGAGGCCTGGAGGCATCACTGCC 
AG G G AAAAG AC C T GAG T GAAT GGG T GGATGGC T G T GAC T T C TACJG AT GGAC GGAACC ATGCA 

CAGCAGGCTGGGAAATGTGGTTTGGTTCCTGACCTAGGCTTGGGAAGACAAGCCAGCGAATA 
AAGGATGGTTGAACGTGAAA 
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FIGURE 81 



</usr/seqdb2/sst/DNA/Dnaseqs.iuin/ss.DNA52187 
<subunit 1 of 1, 14 6 aa, 1 stop 
<MW: 16430, pi: 5.05, NX(S/T): 1 
MLLALVCLLSCLLPSSEAKLYG 

YEADGSTNNGIFQINSRRWCSNLTPNVPNVCRMYCSDLLNPNLKDTVICAMKITQEPQGLGY 
WEAWRHHCQGKDLTEWVDGCDF 

Important features : 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 67-72 

Homolgous region to Alpha-lactalbumin / lysozyme C proteins. 

amino acids 34-58 (catalytic domain) , 111-132 and 66-107 
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FIGURE 82 

AGCCGCTGCCCCGGGCCGGGCGCCCGCGGCGGCACCi^IGAGTCCCCGCTCGTGCCTGCGTTC 
GCTGCGCCTCCTCGTCTTCGCCGTCTTCTCAGCCGCCGCGAGCAA.CTGGCTGTACCTGGCCA 
AGCTGTCGTCGGTGGGGAGCATCTCAGAGGAGGAGACGTGCGAGAAACTCAAGGGCCTGATC 
CAGAGGCAGGTGCAGATGTGCAAGCGGAACCTGGAAGTCATGGACTCGGTGCGCCGCGGTGC 
CCAGCTGGCCATTGAGGAGTGCCAGTACCAGTTCCGGAACCGGCGCTGGAACTGCTCCACAC 
TCGACTCCTTGCCCGTCTTCGGCAAGGTGGTGACGCAAGGGACTCGGGAGGCGGCCTTCGTG 
TACGCCATCTCTTCGGCAGGTGTGGCCTTTGCAGTGACGCGGGCGTGCAGCAGTGGGGAGCT 
GGAGAAGTGCGGCTGTGACAGGACAGTGCATGGGGTCAGCCCACAGGGCTTCCAGTGGTCAG 
GATGCTCTGACAACATCGCCTACGGTGTGGCCTTCTCACAGTCGTTTGTGGATGTGCGGGAG 
AGAAGCAAGGGGGCCTCGTCCAGCAGAGCCCTCATGAACCTCCACAACAATGAGGCCGGCAG 
GAAGGCCATCCTGACACACATGCGGGTGGAATGCAAGTGCCACGGGGTGTCAGGCTCCTGTG 
AGGTAAAGACGTGCTGGCGAGCCGTGCCGCCCTTCCGCCAGGTGGGTCACGCACTGAAGGAG 
AAGTTTGATGGTGCCACTGAGGTGGAGCCACGCCGCGTGGGCTCCTCCAGGGCACTGGTACC 
AC G C AAC G C AC AG T T C AAGC C GC AC AC AGAT GAG GAC C T GG T G T AC T T GGAG C CTAGCCC CG 

ACTTCTGTGAGCAGGACATGCGCAGCGGCGTGCTGGGCACGAGGGGCCGCACATGCAACAAG 
ACGTCCAAGGCCATCGACGGCTGTGAGCTGCTGTGCTGTGGCCGCGGCTTCCACACGGCGCA 
GGTGGAGCTGGCTGAACGCTGCAGCTGCAAATTCCACTGGTGCTGCTTCGTCAAGTGCCGGC 
AGTGCCAGCGGCTCGTGGAGTTGCACACGTGCCGAIS^CCGCCTGCCTAGCCCTGCGCCGGC 
AACCACCTAGTGGCCCAGGGAAGGCCGATAATTTAAACAGTCTCCCACCACCTACCCCAAGA 
GATACTGGTTGTATTTTTTGTTCTGGTTTGGTTTTTGGGTCCTCATGTTATTTATTGCCGAA 
ACCAGGCAGGCAACCCCAAGGGCACCAACCAGGGCCTCCCCAAAGCCTGGGCCTTTGTGGCT 
GCCACTGACCAAAGGGACCTTGCTCGTGCCGCTGGCTGCCCGCATGTGGCTGCCACTGACCA 
C T C AG T T G T TAT CTGTGTCCGTTTTTC T AC T T GC AGACC TAAGG T GGAG T AACAAGGAGTAT 
TACCACCACATGGCTACTGACCGTGTCATCGGGGAAGAGGGGGCCTTATGGCAGGGAAAATA 
G G T AC C GAC T T G A T G G AAG T C AC AC C C T C T G G AAAAAAGAAC T C T T AAC T C T C C AG C ACAC A 

TACACATGGACTCCTGGCAGCTTGAGCCTAGAAGCCATGTCTCTCAAATGCCCTGAGAAAGG 
GAACAAGCAGATACCAGGTCAAGGGCACCAGGTTCATTTCAGCCCTTACATGGACAGCTAGA 
GGTTCGATATCTGTGGGTCCTTCCAGGCAAGAAGAGGGAGATGAGAGCAAGAGACGACTGAA 
GTCCCACCCTAGAACCCAGCCTGCCCCAGCCTGCCCCTGGGAAGAGGAAACTTAACCACTCC 
CC AGAC C C AC C T AG GC AG GC AT AT AG GC T GC CAT CCT GGAC CAGG GAT CCCGGCTGTGCCTT 
TGCAGTCATGCCCGAGTCACCTTTCACAGCGCTGTTCCTCCATGAAACTGAAAAACACACAC 
AC AC AC ACAC ACACAC AC AC AC AC ACACACACAC AC AC GGAC AC AC AC AC ACACC T G CGAGA 
GAGAGGGAGGAAAGGGCTGTGCCTTTGCAGTCATGCCCGAGTCACCTTTCACAGCACTGTTCCTC 
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FIGURE 83 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48328 
<subunit 1 of 1, 351 aa, 1 stop 
<MW: 39052, pi: 8.97, NX(S/T): 2 

MSPRSCLRSLRLLVFAVFSAAASNWLYIJ^LSSVGSISEEETCEKLKGLIQRQVQMCKRNLE 
VMDSVRRGAQLAIEECQYQFRNRRWNCSTLDSLPVFGKWTQGTREAAFVYAI^ 

TRACSSGELEKCGCDRTVHGVSPQGFQWSGCSDNIAYGVAFSQSFVDVRERSKGASSSRALM 
NLHNNEAGRKAILTHMRVECKCHGVSGSCEVKTCWRAVPPFRQVGHALKEKFDGATEVEPRR 

VGSSR2ULVPRNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGTRGRTCNKTSKAIDGCELLC 
CGRGFHTAQVELAERCSCKFHWCCFVKCRQCQRLVELHTCR 

Important features: 
Signal peptide: 

amino acids 1-22 

N-glycosylation sites. 

amino acids 88-91 and 297-300 

Wnt-1 family signature. 

amino acids 206-215 

Homologous region to Wnt-1 family proteins 

amino acids 183-235, 305-350, 97-138, 53-92 and 150 -174 
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FIGURE 84 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGGTGCCTGCAT 
CGCC&2SGACACCACCAGGTACAGCAAGTGGGGCGGCAGCTCCGAGGAGGTCCCCGGAGGGC 
CCTGGGGACGCTGGGTGCACTGGAGCAGGAGACCCCTCTTCTTGGCCCTGGCTGTCCTGGTC 
ACCACAGTCCTTTGGGCTGTGATTCTGAGTATCCTATTGTCCAAGGCCTCCACGGAGCGCGC 
GGCGCTGCTTGACGGCCACGACCTGCTGAGGACAAACGCCTCGAAGCAGACGGCGGCGCTGG 
GTGCCCTGAAGGAGGAGGTCGGAGACTGCCACAGCTGCTGCTCGGGGACGCAGGCGCAGCTG 
CAGACCACGCGCGCGGAGCTTGGGGAGGCGCAGGCGAAGCTGATGGAGCAGGAGAGCGCCCT 
GCGGGAACTGCGTGAGCGCGTGACCCAGGGCTTGGCTGAAGCCGGCAGGGGCCGTGAGGACG 
TCCGCACTGAGCTGTTCCGGGCGCTGGAGGCCGTGAGGCTCCAGAACAACTCCTGCGAGCCG 
TGCCCCACGTCGTGGCTGTCCTTCGAGGGCTCCTGCTACTTTTTCTCTGTGCCAAAGACGAC 
GTGGGCGGCGGCGCAGGATCACTGCGCAGATGCCAGCGCGCACCTGGTGATCGTTGGGGGCC 
TGGATGAGCAGGGCTTCCTCACTCGGAACACGCGTGGCCGTGGTTACTGGCTGGGCCTGAGG 
GCTGTGCGCCATCTGGGCAAGGTTCAGGGCTACCAGTGGGTGGACGGAGTCTCTCTCAGCTT 
CAGCCACTGGAACCAGGGAGAGCCCAATGACGCTTGGGGGCGCGAGAACTGTGTCATGATGC 
TGCACACGGGGCTGTGGAACGACGCACCGTGTGACAGCGAGAAGGACGGCTGGATCTGTGAG 
AAAAGGCACAACTGC1SACCCCGCCCAGTGCCCTGGAGCCGCGCCCATTGCAGCATGTCGTA 
TCCTGGGGGCTGCTCACCTCCCTGGCTCCTGGAGCTGATTGCCAAAGAGTTTTTTTCTTCCT 
CATCCACCGCTGCTGAGTCTCAGAAACACTTGGCCCAACATAGCCCTGTCCAGCCCAGTGCC 
TGGGCTCTGGGACCTCCATGCCGACCTCATCCTAACTCCACTCACGCAGACCCAACCTAACC 
TCCACTAGCTCCAAAATCCCTGCTCCTGCGTCCCCGTGATATGCCTCCACTTCTCTCCCTAA 
CCAAGGTTAGGTGACTGAGGACTGGAGCTGTTTGGTTTTCTCGCATTTTCCACCAAACTGGA 
AGCTGTTTTTGCAGCCTGAGGAAGCATCAATAAATATTTGAGAAATGAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA56352 

<subunit 1 of 1/ 293 aa, 1 stop 

<MW: 32562, pi: 6.53, NX(S/T): 2 

MDT TRY S KWGG S SEE VPGGP WGRWHW SRRPL FI^^ 

LLDGHDLLRTNASKQTAALGALKEEVGDCHSCCSGTQAQLQTTRAELGEAQAKI^EQESALR 
ELRERVTQGLAEAGRGREDVRTELFRALEAVRLQNNSCEPCPTSWLSFEGSCYFFSVPKTTW 
AAAQDHCADASAHLVIVGGLDEQGFLTRNTRGRGYWLGLRAVRHLGKVQGYQWVDGVSLSFS 
HWNQGE PNDAWGRENC VMMLHT GLWNDAPCDS EKDGW I CEKRHNC 

Important features: 

Type II transmembrane domain: 

amino acids 31-54 

N-glycosylation sites . 

amino acids 73-76 and 159-162 

Leucine zipper pattern, 

amino acids 102-123 

N -myri s toy 1 a ti on s i tes . 

amino acids 18-23, 133-138 and 242-247 

C-type lectin domain signature. 

amino acids 264-287 
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GCCAGGGGAAGAGGGTGATCCGACCCGGGGAAGGTCGCTGGGCAGGGCGAGTTGGGAAAGCG 

GCAGCCCCCGCCGCCCCCGCAGCCCCTTCTCCTCCTTTCTCCCACGTCCTATCTGCCTCTCG 

CTGGAGGCCAGGCCGTGCAGCATCGAAGACAGGAGGAACTGGAGCCTCATTGGCCGGCCCGG 

GGCGCCGGCCTCGGGCTTAAATAGGAGCTCCGGGCTCTGGCTGGGACCCGACCGCTGCCGGC 

CGCGCTCCCGCTGCTCCTGCCGGGTGaiSGAAAACCCCAGCCCGGCCGCCGCCCTGGGCAAG 

GCCCTCTGCGCTCTCCTCCTGGCCACTCTCGGCGCCGCCGGCCAGCCTCTTGGGGGAGAGTC 

CATCTGTTCCGCCAGAGCCCCGGCCAAATACAGCATCACCTTCACGGGCAAGTGGAGCCAGA 

CGGCCTTCCCCAAGCAGTACCCCCTGTTCCGCCCCCCTGCGCAGTGGTCTTCGCTGCTGGGG 

GCCGCGCATAGCTCCGACTACAGCATGTGGAGGAAGAACCAGTACGTCAGTAACGGGCTGCG 

CGACTTTGCGGAGCGCGGCGAGGCCTGGGCGCTGATGAAGGAGATCGAGGCGGCGGGGGAGG 

CGCTGCAGAGCGTGCACGAGGTGTTTTCGGCGCCCGCCGTCCCCAGCGGCACCGGGCAGACG 

TCGGCGGAGCTGGAGGTGCAGCGCAGGCACTCGCTGGTCTCGTTTGTGGTGCGCATCGTGCC 

CAGCCCCGACTGGTTCGTGGGCGTGGACAGCCTGGACCTGTGCGACGGGGACCGTTGGCGGG 

AACAGGCGGCGCTGGACCTGTACCCCTACGACGCCGGGACGGACAGCGGCTTCACCTTCTCC 

TCCCCCAACTTCGCCACCATCCCGCAGGACACGGTGACCGAGATAACGTCCTCCTCTCCCAG 

CCACCCGGCCAACTCCTTCTACTACCCGCGGCTGAAGGCCCTGCCTCCCATCGCCAGGGTGA 

CACTGCTGCGGCTGCGACAGAGCCCCAGGGCCTTCATCCCTCCCGCCCCAGTCCTGCCCAGC 

AGGGACAATGAGATTGTAGACAGCGCCTCAGTTCCAGAAACGCCGCTGGACTGCGAGGTCTC 

CCTGTGGTCGTCCTGGGGACTGTGCGGAGGCCACTGTGGGAGGCTCGGGACCAAGAGCAGGA 

CTCGCTACGTCCGGGTCCAGCCCGCCAACAACGGGAGCCCCTGCCCCGAGCTCGAAGAAGAG 

GCTGAGTGCGTCCCTGATAACTGCGTCiaAGACCAGAGCCCCGCAGCCCCTGGGGCCCCCCG 

GAGCCATGGGGTGTCGGGGGCTCCTGTGCAGGCTCATGCTGCAGGCGGCCGAGGGCACAGGG 

GGTTTCGCGCTGCTCCTGACCGCGGTGAGGCCGCGCCGACCATCTCTGCACTGAAGGGCCCT 

CTGGTGGCCGGCACGGGCATTGGGAAACAGCCTCCTCCTTTCCCAACCTTGCTTCTTAGGGG 

CCCCCGTGTCCCGTCTGCTCTCAGCCTCCTCCTCCTGCAGGATAAAGTCATCCCCAAGGCTC 

CAGCTACTCTAAATTATGTCTCCTTATAAGTTATTGCTGCTCCAGGAGATTGTCCTTCATCG 

TCCAGGGGCCTGGCTCCCACGTGGTTGCAGATACCTCAGACCTGGTGCTCTAGGCTGTGCTG 

AGCCCACTCTCCCGAGGGCGCATCCAAGCGGGGGCCACTTGAGAAGTGAATAAATGGGGCGG 

TTTCGGAAGCGTCAGTGTTTCCATGTTATGGATCTCTCTGCGTTTGAATAAAGACTATCTCT 

GTTGCTCACAAAAAAAAAAAAAAAAAAAAAA?VAAAAAAAAAAAAAAJ^ 



&j Z2>=h 
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FIGURE 87 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA53971 
xsubunit 1 of 1, 331 aa, 1 stop 
><MW: 35844, pi: 5.45, NX(S/T): 2 

MENPS PAAALGKALCALLLATLGAAGQPLGGES ICSARAPAKYS ITFTGKW5QTAFPKQYPL 
FRPPAQWS S LLGAAHS S DYSMWRKNQYVSNGLRDFAERGEAWALMKE IEAAGEALQS VHEVF 

SAPAVPSGTGQTSAELEVQRRHSLVSFVVRIVPSPDWFVGVDSLDLCDGDRWREQAALDLYP 
YDAGTDS G FT FS S PNFAT I PQDTVTE I TS S S PSHPANS FYYPRLKALPP IARVTLLRLRQS P 

RAFIPPAPVLPSRDNEIVDSASVPETPLDCEVSLWSSWGLCGGHCGRLGTKSRTRYVRVQPA 
NNGSPCPELEEEAECVPDNCV 

Important features: 
Signal peptide: 

amino acids 1-2 6 





m 
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FIGURE 88 



GGCGGCGTCCGTGAGGGGCTCCTTTGGGCAGGGGTAGTGTTTGGTGTCCCTGTCTTGCGTGA 
TATTGACAAACTGAAGCTTTCCTGCACCACTGGACTTAAGGAAGAGTGTACTCGTAGGCGGA 
CAGCTTTAGTGGCCGGCCGGCCGCTCTCATCCCCCGTAAGGAGCAGAGTCCTTTGTACTGAC 
CAAGAISAGCAACATCTACATCCAGGAGCCTCCCACGAATGGGAAGGTTTTATTGAAAACTA 
CAGCTGGAGATATTGACATAGAGTTGTGGTCCAAAGAAGCTCCTAAAGCTTGCAGAAATTTT 
ATCCAACTTTGTTTGGAAGCTTATTATGACAATACCATTTTTCATAGAGTTGTGCCTGGTTT 
C ATAG T C C AAGG C G G AGAT C C TAG T GGCACAG GGAG T GG T G GAGAGTC T ATCTATGGAGC G C 
CATT CAAAG AT G AAT T T CAT T C ACGG T T GC G T TTTAAT C GGAGAGGAC TGGTT GCC ATGGC A 
AATGCTGGTTCTCATGATAATGGCAGCCAGTTTTTCTTCACACTGGGTCGAGCAGATGAACT 
TAACAATAAGCATACCATCTTTGGAAAGGTTACAGGGGATACAGTATATAACATGTTGCGAC 
TGTCAGAAGTAGACATTGATGATGACGAAAGLACCACATAATCCACACAAAATAAAAAGCTGT 
GAGGTTTTGTTTAATCCTTTTGATGACATCATTCCAAGGGAAATTAAAAGGCTGAAAAAAGA 
GAAAC C AG AG GAG GAAG T AAAGAAAT T GAAAC C CAAAG G C ACAAAAAATT TTAG T T T ACT T T 
CAT T T GGAGAG G AAGC T GAG GAAGAAGAGGAGGAAG T AAAT CGAG T TAG T CAGAGCAT GAAG 
GG C AAAAG C AAAAG TAG T CAT GAC T TG C T TAAGGATG AT C C AC AT C TCAGT T C T G T T CC AG T 
TGTAGAAAGTGAAAAAGGTGATGCACCAGATTTAGTTGATGATGGAGAAGATGAAAGTGCAG 
AGCATGATGAATATATTGATGGTGATGAAAAGAACCTGATGAGAGAAAGAATTGCCAAAAAA 
TTAAAAAAGGACACAAGTGCGAATGTTAAATCAGCTGGAGAAGGAGAAGTGGAGAAGAAATC 
AGTCAGCCGCAGTGAAGAGCTCAGAAAAGAAGCAAGACAATTAAAACGGGAACTCTTAGCAG 
C AAAAC AAAAAAAAG T AG AAAAT GC AG CAAAACAAGCAG AAAAAAGAAG T GAAGAGG AAGAA 
GCCCCTCCAGATGGTGCTGTTGCCGAATACAGAAGAGAAAAGCAAAAGTATGAAGCTTTGAG 
GAAG C AAC AG T CAAAG AAG GGAAC T T C CCGGGAAGAT C AGAC CC T TGCAC T GC T GAAC CAG T 
T T AAAT C TAAAC T C AC T CAAGCAAT T GC TG AAACACC T GAAAAT GACAT T CC T G AAAC AGAA 
GTAGAAGATGATGAAGGATGGATGTCACATGTACTTCAGTTTGAGGATAA7^AGCAGAAAAGT 
GAAAGATGCAAGCATGCAAGACTCAGATACATTTGAAATCTATGATCCTCGGAATCCAGTGA 
AT AAAAG AAGG AGG GAAG AAAGCAAAAAGC T GAT G AGAGAGAAAAAAG AAAGAAG A.JAAAAT 

GAGAATAATGATAACCAGAACTTGCTGGAAATGTGCCTACAATGGCCTTGTAACAGCCATTG 
TTCCCAACAGCATCACTTAGGGGTGTGAAAAGAAGTATTTTTGAACCTGTTGTCTGGTTTTG 
AAAAACAATTATCTTGTTTTGCAAATTGTGGAATGATGTAAGCAAATGCTTTTGGTTACTGG 
TACAT GT G TTTTTTCC TAGC TGACCTT T TATATT GCTAAATCTGAAATAAAATAAC TTT C C T • 
TCCAGAAAAAAAAAAAAAAA 2 VAAAAAAAAAAAAA 



3ojzz=h 
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FIGURE 89 



x/usr/seqdto2/sst/DNA/Dnaseqs*min/ss.DNA50919 
xsubunit 1 of 1, 472 aa, 1 stop 
XMW: 53847, pi: 5.75, NX(S/T): 2 

MSNIYIQEPPTNGKVLLKTTAGDIDIELWSKEAPKACRNFIQLCLEAYYDNTIFHRVVPGFI 

VQGGDPTGTGSGGESIYGAPFKDEFHSRLRFNRRGLVAMANAGSHDNGSQFFFTLGRADELN 
NKHTIFGKVTGDTVYI^LRLSEVD^^ 

PEEEVKKLKPKGTKNFSLLSFGEEAEEEEEEVNRVSQSMKGKSKSSHDLLKDDPHLSSVPW 

ESEKGDAPDLVDDGEDESAEHDEYIDGDEKNLMRERIAKKLKKDTSANV^SAGEGEVEKKSV 

SRSEELRKEARQLKRELLAAKQKKVENAAKQAEKRSEEEEAPPDGAVAEYRREKQKYEALRK 

QQSKKGTSREDQTLALLNQFKSKLTQAIAETPENDIPETEVEDDEGWMSHVLQFEDKSRKVK 
DASMQDS DT FE I YDPRNPVNKRRREESKKLMREKKERR 

Important features : 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 109-112 and 201-204 

Cyclophilin-type peptidyl -prolyl cis-trans isomerase signature. 

amino acids 4 9-66 

Homologous region to Cyclophilin-type peptidyl -prolyl cis-trans 
isomerase 

amino acids 96-140, 49-89 and 22-51 
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FIGURE 90 

CGCCGCCGTTGGGGCTGGAAGTTCCCGCCAGGTCCGTGCCGGGCGAGAGAGATGCTGCCCGG 

CCCGCCTCGGCTTTGAGGCGAGAGAAGTGTCCCAGACCCATTTCGCCTTGCTGACGGCGTCG 

AGCCCTGGCCAGAC&2STCCACAGGGTTCTCCTTCGGGTCCGGGACTCTGGGCTCCACCACC 

GTGGCCGCCGGCGGGACCAGCACAGGCGGCGTTTTCTCCTTCGGAACGGGAACGTCTAGCAA 

CCCTTCTGTGGGGCTCAATTTTGGAAATCTTGGAAGTACTTCAACTCCAGCAACTACATCTG 

CTCCTTCAAGTGGTTTTGGAACCGGGCTCTTTGGATCTAAACCTGCCACTGGGTTCACTCTA 

GGAGGAACAAATACAGGTGCCTTGCACACCAAGAGGCCTCAAGTGGTC7VCCAAATATGGAAC 

CCTGCAAGGAAAACAGATGGATGTGGGGAAGACACCCATCCAAGTCTTTTTAGGAGTCCCCT 

TCTCCAGACCTCCTCTAGGTATCCTCAGGTTTGCACCTCCAGAACCCCCGGAGCCCTGGAAA 

GGAATCAGAGATGCTACCACCTACCCGCCTGGATGGAGTCTCGCTCTGTCGCCAGGCTGGAG 

TGCAGTGGCACGATCTCGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGCGAGTCTCCTGC 

CTCAGCCTCTGAGTGTCTGGGGCTACAGGTGCCTGCAGGAGTCCTGGGGCCAGCTGGCCTCG 

ATGTACGTCAGCACGCGGGAACGGTACAAGTGGCTGCGCTTCAGCGAGGACTGTCTGTACCT 

GAACGTGTACGCGCCGGCGCGCGCGCCCGGGGATCCCCAGCTGCCAGTGATGGTCTGGTTCC 

CGGGAGGCGCCTTCATCGTGGGCGCTGCTTCTTCGTACGAGGGCTCTGACTTGGCCGCCCGC 

GAGAAAGTGGTGCTGGTGTTTCTGCAGCACAGGCTCGGCATCTTCGGCTTCCTGAGCACGGA 

CGACAGCCACGCGCGCGGGAACTGGGGGCTGCTGGACCAGATGGCGGCTCTGCGCTGGGTGC 

AGGAGAAC AT C G C AG C C T T C GGGGGAGAC C C AGGAAAT G T GACCCT G T T CGGCC AG T C GGC G 

GGGGCCATGAGCATCTCAGGACTGATGATGTCACCCCTAGCCTCGGGTCTCTTCCATCGGGC 
CATTTCCCAGAGTGGCACCGCGTTATTCAGACTTTTCATCACTAGTAACCCACTGAAAGTGG 
CCAAGAAGGTTGCCCACCTGGCTGGATGCAACCACAACAGCACACAGATCCTGGTAAACTGC 
CTGAGGGCACTATCAGGGACCAAGGTGATGCGTGTGTCCAACAAGATGAGATTCCTCCAACT 
GAACTTCCAGAGAGACCCGGAAGAGATTATCTGGTCCATGAGCCCTGTGGTGGATGGTGTGG 
TGATCCCAGATGACCCTTTGGTGCTCCTGACCCAGGGGAAGGTTTCATCTGTGCCCTACCTT 
CTAGGTGTCAACAACCTGGAATTCAATTGGCTCTTGCCTTATAATATCACCAAGGAGCAGGT 
ACCACTTGTGGTGGAGGAGTACCTGGACAATGTCAATGAGCATGACTGGAAGATGCTACGAA 
ACC G TAT GAT G G AC AT AG T T CAAGAT GCCAC T T TCG T G TAT GC CACAC T GCAGAC T G C T CAC 

TACCACCGAGAAACCCCAATGATGGGAATCTGCCCTGCTGGCCACGCTACAACAAGGATGAA 
AAG T AC C T G C AG C T G GAT T T T AC CACAAGAG T GGGCATG&AGC T C AAGGAGAAGAAGAT GGC 
T T T T T G GAT GAG T C T G T AC C AG TCT C AAAGAC C T G AGAAGCAGAGGCAATT C TAAGGG T GGC 
TATGCAGGAAGGAGCCAAAGAGGGGTTTGCCCCCACCATCCAGGCCCTGGGGAGACTAGCCA 
TGGACATACCTGGGGACAAGAGTTCTACCCACCCCAGTTTAGAACTGCAGGAGCTCCCTGCT 
GCCTCCAGGCCAAAGCTAGAGCTTTTGCCTGTTGTGTGGGACCTGCACTGCCCTTTCCAGCC 
TGACATCCCATGATGCCCCTCTACTTCACTGTTGACATCCAGTTAGGCCAGGCCCTGTCAAC 
ACCACACTGTGCTCAGCTCTCCAGCCTCAGGACAACCTCTTTTTTTCCCTTCTTCAAATCCT 
CCCACCCTTCAATGTCTCCTTGTGACTCCTTCTTATGGGAGGTCGACCCAGACTGCCACTGC 
CCCTGTCACTGCACCCAGCTTGGCATTTACCATCCATCCTGCTCAACCTTGTTCCTGTCTGT 
TC AC AT T GG C C T G GAG G C C T AGGG CAGG T TG T GAC AT GGAGCAAAC T T T T GG TAG T T T GGGA 
TCTTCTCTCC CAC C CACAC T TATC T C CCC CAGGG CC AC T C CAAAG T C TAT AC ACAGGGG T GG 
TCTC T TC AATAAAGAAG T G T TGAT TAGAAAAAAAAAAA 
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FIGURE 91 

</usr/seqdb2/sst/DNA/Dnaseqs .iain/ss *DNA4417 9 
<subunit 1 of 1, 545 aa, 1 stop 
<MW: 58934, pi: 9.45, NX(S/T): 4 

MSTGFSFGSGTLGSTTVAAGGTSTGGVFS FGTGTSSNPSVGLNFGNLGSTSTPATTSAPSSG 
FGTGLFGSKPATGFTLGGTNTGALHTKRPQWTKYGTLQGKQMHVGKTPIQVFLGVPFSRPP 
LGILRFAPPEPPEPWKGIRDATTYPPGWSLALSPGWSAVARSRLTATSASRVQASLLPQPLS 
WGYRCLQESWGQLASMYVSTRERYKWLRFSEDCLYLl^APARAPGDPQLPVMVWFPGGAF 

IVGAASSYEGSDLAAREKWLVFLQHRLGIFGFLSTDDSHARGNWGLLDQMAALRWQEIilA 
AFGGDPGNVTLFGQSAGAMSISGLMMSPLASGLFHRAISQ 

HLAGCNHNSTQILVNCLRALSGTKVMRVSNKMRFLQLNFQRDPEEITWSMSPWDGVVIPDD 

PLVLLTQGKVSSVPYLLGVNNLEFNWLLPYNITKEQVPLVVEEYLDNVNEHDW 

I VQDAT FVYAT LQ TAH YHRE T PMMG I CP AGHAT TRMKS T C 5 W I L P QEWA 

Important features: 
Signal peptide: 

amino acids 1-29 

Carboxyl esterases type-B serine active site. 

amino acids 312-327 

Carboxyl esterases type-B signature 2. 

amino acids 218-228 

N-glycosylation sites. 

amino acids 318-321, 380-383 and 465-468 



33/23? 
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GAGAACAGGCCTGTCTCAGGCAGGCCCTGCGCCTCCTATGCGGAGMSCTACTGCCACTGCT 
GCTGTCCTCGCTGCTGGGCGGGTCCCAGGCTATGGATGGGAGATTCTGGATACGAGTGCAGG 
AGTCAGTGATGGTGCCGGAGGGCCTGTGCATCTCTGTGCCCTGCTCTTTCTCCTACCCCCGA 
CAAGACTGGACAGGGTCTACCCCAGCTTATGGCTACTGGTTCAAAGCAGTGACTGAGACAAC 
CAAGGGTGCTCCTGTGGCCACAAACCACCAGAGTCGAGAGGTGGAAATGAGCACCCGGGGCC 
GATTCCAGCTCACTGGGGATCCCGCCAAGGGGAACTGCTCCTTGGTGATCAGAGACGCGCAG 
ATGCAGGATGAGTCACAGTACTTCTTTCGGGTGGAGAGAGGAAGCTATGTGACATATAATTT 
CATGAACGATGGGTTCTTTCTAAAAGTAACAGTGCTCAGCTTCACGCCCAGACCCCAGGACC 
ACAACACCGACCTCACCTGCCATGTGGACTTCTCCAGAAAGGGTGTGAGCGCACAGAGGACC 
GTCCGACTCCGTGTGGCCTATGCCCCCAGAGACCTTGTTATCAGCATTTCACGTGACAACAC 
GC C AG C C C T G GAG C C C C AG C C C C AGGGAAATG TC C C ATAC C T GGAAGC CCAAAAAGGC CAGT 

TCCTGCGGCTCCTC.TGTGCTGCTGACAGCCAGCCCCCTGCCACACTGAGCTGGGTCCTGCAG 
AACAGAGTCCTCTCCTCGTCCCATCCCTGGGGCCCTAGACCCCTGGGGCTGGAGCTGCCCGG 
GGTGAAGGCTGGGGATTCAGGGCGCTACACCTGCCGAGCGGAGAACAGGCTTGGCTCCCAGC 
AGCGAGCCCTGGACCTCTCTGTGCAGTATCCTCCAGAGAACCTGAGAGTGATGGTTTCCCAA 
GCAAACAGGACAGTCCTGGAAAACCTTGGGAACGGCACGTCTCTCCCAGTACTGGAGGGCCA 
AAGCCTGTGCCTGGTCTGTGTCACACACAGCAGCCCCCCAGCCAGGCTGAGCTGGACCCAGA 
GGGGACAGGTTCTGAGCCCCTCCCAGCCCTCAGACCCCGGGGTCCTGGAGCTGCCTCGGGTT 
CAAGTGGAGCACGAAGGAGAGTTCACCTGCCACGCTCGGCACCCACTGGGCTCCCAGCACGT 
C T C T C T C AG CCTCTCCG T GC AC TATAAGAAGGGAC TCAT C TCAAC GGC AT T C TCCAAC GGAG 

CGTTTCTGGGAATCGGCATCACGGCTCTTCTTTTCCTCTGCCTGGCCCTGATCATCATGAAG 
AT T C T AC C GAAGAGAC G GAC T CAGACAGAAAC C C C GAGGC C C AGG T T C T C CC GGCAC AGCAC 

GATCCTGGATTACATCAATGTGGTCCCGACGGCTGGCCCCCTGGCTCAGAAGCGGAATCAGA 
AAGCCACACCAAACAGTCCTCGGACCCCTCCTCCACCAGGTGCTCCCTCCCCAGAATCAAAG 
AAGAACCAGAAAAAGCAGTATCAGTTGCCCAGTTTCCCAGAACCCAAATCATCCACTCAAGC 
CCCAGAATCCCAGGAGAGCCAAGAGGAGCTCCATTATGCCACGCTCAACTTCCCAGGCGTCA 
GACCCAGGCCTGAGGCCCGGATGCCCAAGGGCACCCAGGCGGATTATGCAGAAGTCAAGTTC 
CAAIG^GGGTCTCTTAGGCTTTAGGACTGGGACTTCGGCTAGGGAGGAAGGTAGAGTAAGAG 
GTTGAAGATAACAGAGTGCAAAGTTTCCTTCTCTCCCTCTCTCTCTCTCTTTCTCTCTCTCT 
CTCTCTTTCTCTCTCTTTTAAAAAAACATCTGGCCAGGGCACAGTGGCTCACGCCTGTAATC 
CCAGCACTTTGGGAGGTTGAGGTGGGCAGATCGCCTGAGGTCGGGAGTTCGAGACCAGCCTG 
GCCAACTTGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGCAGG 
C G C C T G T AAT C C TAG C TAG T T G GGAAGC T GAGGCAGGAGAAT CAC T T G AAC C T GGG AGAC GG 
AGG T T G C AG T GAG C C AAG AT C ACACC AT T G C AC GCCAG C C T GG GC AAC AAAGCGAGACT C CA 
TCTCAAAAAAAAAATCCTCCAA?ITGGGTTGGGTGTCTGTAATCCCAGCACTTTGGGAGGCTA 
AGGTGGGTGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAACCCC 
ATCTCTACAAAAAATACAAAACATAGCTGGGCTTGGTGGTGTGTGCCTGTAGTCCCAGCTGT 
CAGACATTTAAACCAGAGCAACTCCATCTGGAATAGGAGCTGAATAAAATGAGGCTGAGACC 
TACTGGGCTGCATTCTCAGACAGTGGAGGCATTCTAAGTCACAGGATGAGACAGGAGGTCCG 
TACAAGATACAGGTCATAAAGACTTTGCTGATAAAACAGATTGCAGTAAAGAAGCCAACCAA 
AT C C CAC C AAAAC CAAG T T G GC C AC GAGAG TGACC TCTGGTCGTC C TC AC TGC TACAC TCC T 

GACAGCACCATGACAGTTTACAAATGCCATGGCAACATCAGGAAGTTACCCGATATGTCCCA 
AAAGGGGGAGGAATGAATAATCCACCCCTTGTTTAGCAAATAAGCAAGAAATAACCATAAAA 

GTGGGCAACCAGCAGCTCTAGGCGCTGCTCTTGTCTATGGAGTAGCCATTCTTTTGTTCCTT 
TACTTTCTTAATAAACTTGCTTTCACCTTAAAAAAA 
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FIGURE 93 

X/usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA54002 
xsubunit 1 of 1, 544 aa, 1 stop 
><MW: 60268, pi: 9.53, NX{S/T): 3 

MLLPLLLSSLLGGSQAMDGRFWIRVQESVMVPEGLCISVPCSFSYPRQDWTGSTPAYGYW-FK 

AVTETTKGAPVATNHQSREVEMSTRGRFQLTGDPAKGNCSLVIRDAQMQDESQYFFRVERGS 

YVTYNFMNDGFFLKVTVLSFTPRPQDHNTDLTCHVDFSRKGVSAQRTVRLRVAYAPRDLVIS 

ISRDNTPALEPQPQGWPYLEAQKGQFLRLLCAADSQPPATLSWVLQNRVLSSSHPWGPRPL 

GLELPGVKAGDSGRYTCRAENRLGSQQRALDLSVQYPPENLRVMVSQANRTVLENLGNGTSL 

PVLEGQSLCLVCVTHSSPPARLSWTQRGQVLSPSQPSDPGVLELPRVQVEHEGEFTCHARHP 

LGSQHVSLSLSVHYKKGLISTAFSNGAFLGIGITALLFLCLALIIl^KILPKRRTQTETPRPR 

FSRHSTILDYI3STWPTAGPLAQKRNQKATPNSPRTPPPPGAPSPESKKNQKKQYQLPSFP 
KS S T QAPE SQES QEE LH YATLNFPGVRPRPE ARMP KGT QADYAEVKFQ 

Important features: 
Signal peptide: 
amino acids 1-15 

Transmembrane domain: 

amino acids 399-418 

N-glycosylation site. 

amino acids 100-103, 297-300 and 306-309 

Immunoglobulins and major histocompatibility complex proteins 
signature. 

amino acids 365-371 



35 j £39- 
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T G AAGAG T AATAG T T GGAAT CA&^^ 

TATGTTGGGAATTCCTCTCCTATGGCCTTGTCTTGGAGCAACAGAAAACTCTCAAACAAAGA 
AAGTCAAGCAGCCAGTGCGATCTCATTTGAGAGTGAAGCGTGGCTGGGTGTGGAACCAATTT 
T T T G T AC C AGAGGAAAT GAAT ACGAC TAG T C ATCACATCGG CC AGC T AAGAT C TGATTTAGA 

CAATGGAAACAATTCTTTCCAGTACAAGCTTTTGGGAGCTGGAGCTGGAAGTACTT.TTATCA 
T T GAT GAAAGAACAGGT GACATATATGC CATACAGAAGCT TGATAGAGAGGAGCGATCCCTC 

TACATCTTAAGAGCCCAGGTAATAGACATCGCTACTGGAAGGGCTGTGGAACCTGAGTCTGA 
GTTTGTCATCAAAGTTTCGGATATCAATGACAATGAACCAAAATTCCTAGATGAACCTTATG 
AGGCCATTGTACCAGAGATGTCTCCAGAAGGAACATTAGTTATCCAGGTGACAGCAAGTGAT 
GCTGACGATCCCTCAAGTGGTAATAATGCTCGTCTCCTCTACAGCTTACTTCAAGGCCAGCC 
ATATTTTTCTGTTGAACCAACAACAGGAGTCATAAGAATATCTTCTAAAATGGATAGAGAAC 
TGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGTCAGCCAGGAGCGTTG 
TCTGGAACAACAAGTGTATTAATT7VAACTTTCAGATGTTAATGACAATAAGCCTATATTTAA 
AGAAAG T T TAT AC C GC T TGAC TG TC T C T GAAT C TGCACCCAC T GGGAC T TC T ATAGGAACAA 

TCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTACAGCATTGAAGAGGAT 
GATTCGCAAACATTTGACATTATTACTAATCATGAAACTCAAGAAGGAATAGTTATATTAAA 
• AAAGAAAG T GGAT T T T G AGC AC C AGAAC CAC T ACG G TAT TAGAG CAAAAG T T AAAAACG&.TC 
AT G T T C C T GAGC AG C T CAT G AAG TAC CACAC T GAGGC T T CC AC CAC T TT CAT TAAGATCCAG 

GTGGAAGATGTTGATGAGCCTCCTCTTTTCCTCCTTCCATATTATGTATTTGAAGTTTTTGA 
AGAAACCCCACAGGGATCATTTGTAGGCGTGGTGTCTGCCACAG'ACCCAGACAATAGGAAAT 
CTCCTATCAGGTATTCTATTACTAGGAGCAAAGTGTTCAATATCAATGATAATGGTACAATC 
ACTACAAGTAACTCACTGGATCGTGAAATCAGTGCTTGGTACAACCTAAGTATTACAGCCAC 
AGAAAAATACAATATAGAACAGATCTCTTCGATCCCACTGTATGTGCAAGTTCTTAACATCA 
ATGATCATGCTCCTGAGTTCTCTCAATACTATGAGACTTATGTTTGTGAAAATGCAGGCTCT 
GG TC AGGTAATTCAGAC TATCAGT GCAG T GGATAGAGATGAATCCATAGAAGAGGACCATTT 
TTACTTTAATCTATCTGTAGAAGACACTAACAATTCAAGTTTTACAATCATAGATAATCAAG 
AT AAC AC AG C T G T CAT T T T G AC TAATAG AAC TGG T T TTAAC CT T CAAGAAGAACCTG TCTTC 
TAC AT C T C CAT C T T AAT T G C C GACAAT G GAATCC C G T CAC T TAC AAG T AC AAACACC C T TAC 
CAT C CAT G T C T G T G AC T G T G G T GAC AG T GGG AG CACAC AGAC C T G C C AG TAC C AG GAG C T T G 
TGCTTTCCATGGGATTCAAGACAGAAGTTATCATTGCTATTCTCATTTGCATTATGATCATA 
TTTGGGTTTATTTTTTTGACTTTGGGTTTAAAACAACGGAGAAAACAGATTCTATTTCCTGA 
GAAAAGTGAAGATTTCAGAGAGAATATATTCCAATATGATGATGAAGGGGGTGGAGAAGAAG 
ATACAGAGGCCTTTGATATAGCAGAGCTGAGGAGTAGTACCATAATGCGGGAACGCAAGACT 
CGGAAAACCACAAGCGCTGAGATCAGGAGCCTATACAGGCAGTCTTTGCAAGTTGGCCCCGA 
CAGTGCCATATTCAGGAAATTCATTCTGGAAAAGCTCGAAGAAGCTAATACTGATCCGTGTG 
CCCCTCCTTTTGATTCCCTCCAGACCTACGCTTTTGAGGGAACAGGGTCATTAGCTGGATCC 
C T GAG C T C C T T AGAAT C AG C AG TC T C T GAT CAGGATGAAAGC T AT GAT T ACC T T AATGAGTT 
GGGACC T C GC T T TAAAAG AT TAGC AT GCAT GTTTGGTT C TGCAG T GCAG T CAAATAATI&GG 
GCTTTTTACCATCAAAATTTTTAAAAGTGCTAATGTGTATTCGAACCCAATGGTAGTCTTAA 
AGAGTTTTGTGCCCTGGCTCTATGGCGGGGAAAGCCCTAGTCTATGGAGTTTTCTGATTTCC 
C T GGAG TAAATAC T C C ATG G T TAT T T TAAGC TACCTACAT G C T G T CAT T GAAC AGAGATGTG 

GGGAGAAATGTAAACAATCAGCTCACAGGCATCAATACAACCAGATTTGAAGTAAAATAATG 
TAGGAAGATATTAAAAGTAGATGAGAGGACACAAGATGTAGTCGATCCTTATGCGATTATAT 
CATTAT T TAC T TAGGAAAGAG TAAAAATACC AAACGAGAAAAT T TAAAGGAGCAAAAATTTG 
CAAGTCAAATAGAAATGTACAAATCGAGATAACATTTACATTTCTATCATATTGACATGAAA 
AT T GAAAATG T AT AG T C AG AGAA&T T T T CAT GAAT TAT T CC AT GAAG TAT T G T T TC C T TTAT 
TTAAA 
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FIGURE 95 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA53906 
Xsubunit 1 of 1, 772 aa, 1 stop 
XMW: 87002, pi: 4.64, NX(S/T): 8 

MNC YLLLRFMLG I PLLWPCLGATENSQTKKVKQPVRSHLRVKRGWVWNQFFVPEEMNTTSHH 
IGQLRSDLDNGNNSFQYKLLGAGAGSTFIIDERTGDIYAIQKLDREERSLYILRAQVIDIAT 
GRAVEPESEFVIKVSDINDNEPKFLDEPYEAIVPEMSPEGTLVIQVTASDADDPSSGNNARL 
LYSLLQGQPYFSVEPTTGVIRISSKMDRELQDEYWVIIQAKDMIGQPGALSGTTSVLIKLSD 
VNDNKPIFKESLYRLTVSESAPTGTSIGTIMAYDNDIGENAEMDYSIEEDDSQTFDIITNHE 
TQEGIVILKKKVDFEHQNHYGIRAKVlCNHHVPEQmKYHTEASTTFIKIQVEDVDEPPLFLL 
P YYVFE VFEE T PQGS FVG WSATDPDNRKS P I RYS I TRSKVTOINDNG T I TTSNS LDRE I SA 
WYNLS I TATEKYN I E Q I S S I PL YVQVLN I NDHAPE FSQYYETYVCENAGSGQVI QT I SAVDR 
DESIEEHHFYFNLSVEDTNNSSFTIIDNQDNTAVILTNRTGFNLQEEPVFYISILIADNGIP 
SLTSTNTLTIHVCDCGDSGSTQTCQYQELVLSMGFKTEVXIAILICIMIIFGFIFLTLGLKQ 
RRKQILFPEKSEDFRENIFQYDDEGGGEEDTEAFDIAELRSSTIMRERKTRKTTSAEIRSLY 

RQSLQVGPDSAIFRKFILEKLEEANTDPCAPPFDSLQTYAFEGTGSLAGSLSSLESAVSDQD 
ESYDYLNE.LGPRFKRLACMFGSAVQSNN 



Important features : 
Signal peptide: 

amino acids 1-21 



Transmembrane domain: 
amino acids 597-617 



N-glycosylation sites. 

amino acids 57-60, 74-77, 419-423, 437-440, 508-511, 515-518, 
516-519 and 534-537 



Cadherins extracellular repeated domain signature, 
amino acids 136-146 and 244-254 
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ATTTCAAGGCCAGCCATATTTTTN7GTTGAACCAACAACAGGAGTCATAAGAATATTTTNTA 
AAATGGATAGAGAACTGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGT 
CAGCCAGGAGCGTTGTNTGGAACAACAAGTGTATTAATTAAACTTTCAGATGTTAATGACAA 
TAAGCCTATATTTAAAGAAAGTTTATACCGCTTGACTGTNTNTGAATCTGCACCCACTGGGA 
NTTNTATAGGAACAATCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTAC 
AGCATTGAAGAGGATGATTCGCAAACATTTGACATTATT 
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FIGURE 97 

GCAACCTCAGCTTCTAGTATCCAGACTCCAGCGCCGCCCCGGGCGCGGACCCCAACCCCGAC 
CCAGAGCTTCTCCAGCGGCGGCGCAGCGAGCAGGGCTCCCCGCCTTAACTTCCTCCGCGGGG 
CCCAGCCACCTTCGGGAGTCCGGGTTGCCCACCTGCAAACTCTCCGCCTTCTGCACCT'GCCA 
CCCCTGAGCCAGCGCGGGCCCCCGAGCGAGTCAISGCCAACGCGGGGCTGCAGCTGTTGGGC 
TTCATTCTCGCCTTCCTGGGATGGATCGGCGCCATCGTCAGCACTGCCCTGCCCCAGTGGAG 
GATTTACTCCTATGCCGGCGACAACATCGTGACCGCCCAGGCCATGTACGAGGGGCTGTGGA 
TGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAAT 
CTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGAT 
AGCAAT CTTTGTGGC C AC C G T T GGCAT GAAG TG TATGAAGT GC TT GGAAGACGATGAGGTGC 

AGAAGATGAGGATGGCTGTCATTGGGGGTGCGATATTTCTTCTTGCAGGTCTGGCTATTTTA 
GTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCCCAGT 
CAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCC 
TTCTGGGAGGTGCCCTACTTTGCTGTTCCTGTCCCCGAAAAACAACCTCTTACCCAACACCA 
AGGCCCTATCCAAAACCTGCACCTTCCAGCGGGAAAGACTACGTG^G^CACAGAGGCAAAAG 
G AG AAAAT CAT G T T GAAACAAAC C G AAAAT GGACAT T GAGATAC TAT CAT TAAC AT TAGGAC 
CTTAGAAT T T T GGGTAT T G TAATCTGAAG TATGGTATTACAAAACAAACAAACAAACAAAAA 
ACCCATGTGTTAAAATACTCAGTGCTAAACATGGCTTAATCTTATTTTATCTTCTTTCCTCA 
AT AT AGG AG G GAAG AT T T T T C CAT T TG TAT T AC TGC TT C C CAT T GAG TAAT C ATAC T CAAAT 
GGG GG AAGG G GTGCTCCT T AAATATAT ATAGATATG T ATAT AT AC AT GT T TT TC TAT TAAAA 
AT AGAC AG T AAAAT AC TAT T C TCAT TAT G TT GAT AC T AGCATAC T TAAAAT AT C T C TAAAAT 
AGGTAAATGTATTTAATTCCATATTGATGAAGATGTTTATTGGTATATTTTCTTTTTCGTCC 
TTATATACATATGTAACAGTCAAATATCATTTACTCTTCTTCATTAGCTTTGGGTGCCTTTG 
CCACAAGACCTAGCCTAATTTACCAAGGATGAATTCTTTCAATTCTTCATGCGTGCCCTTTT 
CAT AT AC T TAT T T TAT T T T T T AC C AT AAT C T TAT AG C AC T T G CAT C G T TAT T AAG C C C T TAT 

TTGTTTTGTGTTTCATTGGTCTCTATCTCCTGAATCTAACACATTTCATAGCCTACATTTTA 
G T T T C T AAAG C CAAGAAGAAT T TAT T ACAAAT CAGAAC T T T GGAGGCAAATC T T T C T GCATG 

ACCAAAGTGATAAATTCCTGTTGACCTTCCCACACAATCCCTGTACTCTGACCCATAGCACT 
CTTGTTTGCTTTGAAAATATTTGTCCAATTGAGTAGCTGCATGCTGTTCCCCCAGGTGTTGT 
AAC AC AAC T T TAT T GAT T G AAT T T T T AAGC TAC T TAT T CAT AG T T T TAT AT C CCC C T AAAC T 
ACCTTTTTGTTCCCCATTCCTTAATTGTATTGTTTTCCCAAGTGTAATTATCATGCGTTTTA 
TATCTTCCTAATAAGGTGTGGTCTGTTTGTCTGAACAAAGTGCTAGACTTTCTGGAGTGATA 
ATCTGGTGACAAATATTCTCTCTGTAGCTGTAAGCAAGTCACTTAATCTTTCTACCTCTTTT 
T T C TAT C T G C CAAAT T G AGATAATG ATAC T TAAC C AGT TAGAAGAGG T AG T G T GAATAT TAA 
TTAGTTTATATTACTCTTATTCTTTGAACATGAACTATGCCTATGTAGTGTCTTTATTTGCT 
CAGCTGGCTGAGACACTGAAGAAGTCACTGAACAAAACCTACACACGTACCTTCATGTGATT 
CAC TGCCTTCCTCTCTC TAC C AGTC TAT T TC CACT GAACAAAAC C T ACAC AC AT ACC T T CAT 
G T GG T T C AG TGCCTTCCTCTCTC TAC C AG TC TAT T T CC ACT GAACAAAAC CT ACGC ACATAC 
CTTCATGTGGCTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCATTCTTTCAGCTGTGTCT 
GACATGTTTGTGCTCTGTTCCATTTTAACAACTGCTCTTACTTTTCCAGTCTGTACAGAATG 
CTATTTCACTTGAGCAAGATGATGTAATGGAAAGGGTGTTGGCACTGGTGTCTGGAGACCTG 
GATTTGAGTCTTGGTGCTATCAATCACCGTCTGTGTTTGAGCAAGGCATTTGGCTGCTGTAA 

GCTTATTGCTTCATCTGTAAGCGGTGGTTTGTAATTCCTGATCTTCCCACCTCACAGTGATG 
TTGTGGGGATCCAGTGAGATAGAATACATGTAAGTGTGGTTTTGTAATTTAAAAAGTGCTAT 

ACTAAGGGAAAGAATTGAGGAATTAACTGCATACGTTTTGGTGTTGCTTTTCAAATGTTTGA 
AAATAAAAAAAATGTTAAG 
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FIGURE 98 



X/usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA52185 
xsubunit 1 of 1, 211 aa, 1 stop 
><MW: 22744, pi; 8.51, NX(S/T): 1 
MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYA^ 

QCKVFDS LLNLS S TLQATRALMWGILLGVIAI FVATVGMKCMKCLEDDEVQKMRMAVIGGA 

IFLIAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGALLCCSC 
PRKTTSYPTPRPYPKPAPSSGKDYV 

Important features: 
Signal peptide : 

amino acids 1-21 

Transmembrane domains: 

amino acids 82-102, 118-142 and 161-187 

N-glycosylation site, 

amino acids 72-75 

PMP-22 / EMP / MP20 family proteins 

amino acids 7 0-111 

ABC-2 type transport system integral membrane protein 

amino acids 119-133 
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FIGURE 99 



TTCTGGCCAAACCCGGGGCTNCAGCTGTTGGGCTTCATCTCGCCTTCCTGGGATGGATCGGC 
GCCATCNTCACACTGCCCTTCCCCAGTGGAGGATTTTACTCCCTATGCTGGCGACAACATCG 
TGACCGCCCAGCCCATGTACGAGGGGCTGTGGATGTCCNGCGTGTCGCAGAGCACCGGGCAG 
ATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACATTGCAAGCAACCCGTGC 
CTTGATGGTGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGA 
AGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGC 
G C G ATAT TTCTTCTTG C AGG T C T GG C TAT T T TAGT TGCCAC AG CAT G G TAT G GCAAT AGAAN 
CNTTCAACANTTCTAT GACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCA 
GGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCT 
GTTCCTGTCCC 
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FIGURE 100 



ACCCTTGACCCAACGCGGCCCCCCGACCGNTTCATGGCCAAACGCGGGNCTCCAGCTGTrGG 
GCTTCATTCTCCCCTTCCTGGGATGGACCGGCGCCCATCNTCAGCACTGCCCTGCCCCAGTG 
GAGGATTTACTCCTATNCCGGCNACAACATCGTGACCGCCCAGGCCNTGTACGAGGGGCTGT 
GGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCCTTGCT 
GAATCTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAG 
TGATAGCAATCTTNNTGGCCACCGTTGTNNNTGAAGTGTATGAAGTGCTTGGAAGACGATGA 
GGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTA 
TTT TAGT TGCCACAGCATGGTATGGCAATAGAAT CGTTCAAGAAT T C TAT GACCC T AT GACCGA 
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FIGURE 101 

GGGCCCGACCATTATCCAACCGGGNTCACTGTTGGCTCATCTCCCTCCTGGATGAANCGCGC 
CAT CN T C AG AC TCCCTGCCC CAT GGAGAT T TNNCC TAT G C T GG CGAC AAC ATCNT GAC CC C C 
AG C CAT G TAG G AGGG G C T T T GAACG T CNG C G TG TC G C AGANCACC GGGC AGATCCAG T GCAA 
AGTCTTTGACTCCTTGCTGAATCTGNGCAGCACATTGCAGCAACCCNTGCCCTGATGGTGGT 
TGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGT 
G C T T G GAAG AC GAT GAG G T G C AGAAG ATGAGGAT GG C T G T CAT T G G GGG C GC GATAT T TCT T 
CTTGCAGGTCTGGCTATTTNNNGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAAT 
TCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGC 
TGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGA 



WO 99/46281 



PCT/US99/05028 



FIGURE 102 

ATTCTCCCCTCCTGGATGGATCGCNCCACCGTCACATTGCCTTCCCCCANTGGAGGATTNAC 
TCCTATGCTGGCGACAACATCGTGACCCCCCAGGCCATTTACCGAGGGGCTTTGGATGTCNT 
GCNTGTCGCAGAGCACCGGGCAGATCCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAG 
CAGCACATTGCAAGCAACCCGTGCCTTGATGGGGTTGGCATCCTCCTGGGAGTGATAGCAAC 
CTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCCAGAAG 
ATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTGTTGCAGGTCTGGCTATTTTAGTNGC 
C ACAG CAT G G TAT G G C AAT AGANTNNT T CNNGNNNT C TAT GAC C C TAT GAC C C C AG T C AATG 

CCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTG 
GGAGGTGCCCTACTTTGCTGTTCCTGTCCC 
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F IGURE 103 

AGAG C AC C G G C AGAT C C C AG TNC AAAG T C T T T GACCCT T G C T GAAT C T GAG CAG C AC AT TNC 

AAGCAACCCCTTGCCTTGAAGGTGGTTGNCATCCCCCCTGGGAGTGAATAGCAATCTTTGTG 
G CCAC C G T T G G CAT GAAG TNT ATGAAG TGC T T GGAAG AC GAT GAGG TGC AGAAG AT GAGGAT 

GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTNNCCACAGCAT 

GGTATGGCAATAGNATNNTTCGNGGNTTCTATGACCCTATGACCCCAGTCAATGCCAGGTAC 

GAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGC 
CCTACTTTGCTGTTCCTGTCCCCGAA 
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FI GURE 1Q4 

AGCAATGCCCTGCCCCCAGTGGAGGATTAATTCCTATGNTGGGGACAACATTGTGACNGCCC 
AGGCCATGTACGGGGGGCTGTGGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGC 
A7\AGTNTTTGACTCCTTGCTGAATTTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGT 
GGTTGGCATCTTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTGGNAATGAAGTGTATGA 
AGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTT 
C T TNT T G C AGG T C T G G C TAT T T TAG T T G CC ACAG C AT GG TAT G G CAAT AGAATNG T T CAAGA 

ATTTTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTTTNTTCACTG 
GCTGGGCTGCTGCTTNTTTCTGCCTTNTGGGAGGTGCCCTANTTTGCTGTTCCTGCGAACC 
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FIGURE 105 

TCATAGGGGGGCGCGATATTTTTTCTTGCAGGTNTGGTTATTTTAGTTGCCACAGCATGGTA 

TGGCAATAGAATCGTTCAAGAATTNTATGACCCTATGACCCCAGTCAATGCCAGGTACGAAT 

TTGGTCAGGCTCTNTTCACTGGNTGGGCTGCTGCTTCTNTNNGCCTTNTGGGAGGTGCCCTA 
CTTTGCTGTTCCTG 
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FIGURE 106 



TTCCTGGGATGGATCCGCCCCCATCNTCACATGCCCTGCCCCNTGGAGATTTACNCCTATGC 

TGGCGAACAACATCNTGACCGCCCAGGCCATGTACGAGGGGCTGTGGAATGTCCTGCGTGTC 

CCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACAT 

TGCAAGCAACCNTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGG 

CCACCGTTGGCATGAAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 

GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGNNGCCACAGCAT 

GGTATGGCAATCAGACCCNNTCANAAACTCTATGACCCTATGACCCCAGTCAATGCCAGGTA 

CGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTG 
CCCTACTTTGCTGTTCCTGTCCCCGAAAAACAACCTCTTACCCACG 
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FIGURE 107 



CGGGGCTGCAGCTGTTGGGCTTCATCTCGCTTCCTGGGATGGAATCGGCGCCATCGTCAGCA 

CTGCCCTGCCCCATGGAGGATTTACTCNTATGCTGGCGACAACATCGTGACCNCCCAGGCCA 

TGTACGAGGGGCTGTGGATGTCNGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCT 

TTGACTCCTTGCTGAATCTGAGCAGCACATTGCAAGCAACCNTGCCTTGATGGTGGTTGGCA 

TCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTG 

GAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGC 
AGG TCTGGCTATT TNT AG T T G C C AC AGC ATGG T AT G GC AAT AGAAT CGT T CAAGAAT T C TAT 

GACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGC 
TGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGAA 
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FIGURE 108 

GCGTGCCGTCAGCTCGCCGGGCACCGCGGCCTCGCCCTCGCCCTCCGCCCCTGCGCCTGCAC 
CGCGTAGACCGACCCCCCCCTCCAGCGCGCCCACCCGGTAGAGGACCCCCGCCCGTGCCCCG 
ACCGGTCCCCGCCTTTTTGTAAAACTTAAAGCGGGCGCAGCATTAACGCTTCCCGCCCCGGT 
GACCTCTCAGGGGTCTCCCCGCCAAAGGTGCTCCGCCGCTAAGGAACAT5GCGAAGGTGGAG 
CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAGGTCCCTTCACCGATGTTGT 
C A C C AC C AA C C T AAAG C T T G G C AAC C C G AC AG AC C G AAAT GTGTGTTT T AAG G T GAAG AC T A 

CAGCACCACGTAGGTACTGTGTGAGGCCCAACAGCGGAATCATCGATGCAGGGGCCTCAATT 
AATGTATCTGTGATGTTACAGCCTTTCGATTATGATCCCAATGAGAAAAGTAAACACAAGTT 
TAT G G T T C AG T C TAT GTTTGCTC CAAC T GAC AC T T C AGAT AT G GAAGC AG T AT GGAAGGAGG 
CAAAAC C G GAAG AC C T TAT G GAT T CAAAAC T T AGATG T G T G T T T GAAT T G C C AG C AGAGAAT 
G AT AAAC C AC AT GAT G TAG AAAT AAAT AAAAT TAT AT C C ACAAC T G CAT C AAAGAC AGAAAC 
AC C AAT AG T G T C T AAG T C T C T GAG TTCTTCTT TGGAT GAC AC C GAAG T T AAGAAGG T TATGG 
AAGAAT G T AAG AG G C T G CAAGG T GAAG T T C AGAG G C T AC GGGAG GAGAACAAGCAGT T CAAG 
GAAGAAGAT GG AC T G C G GAT G AGG AAGAC AG T GCAGAGCAAC AGC C C CAT T T CAG CAT TAGC 
CCCAACTGGGAAGGAAGAAGGCCTTAGCACCCGGCTCTTGGCTCTGGTGGTTTTGTTCTTTA 
TCGTTGGTGTAATTATTGGGAAGATTGCCTTG2ASAGGTAGCATGCACAGGATGGTAAATTG 
GAT TGGTGGATC C AC CAT AT CAT GGGAT T TAAAT T TAT C ATAAC CAT G T G TAAAAAGAAAT T 
AAT G TAT GAT GAC AT C T C AC AGG TCTTGCCTT T AAAT T AC C C C T C C C T GCAC AC AC AT AC AC 
AG AT AC AC AC AC AC AAAT AT AAT G TAAC GAT C T T T TAG AAAG T T AAAAAT G T ATAG TAAC T G 
AT T GAGG G G GAAAAAG AAT GAT C T T TAT TAAT GAC AAG GGAAAC CAT GAGTAAT G C CAC AAT 
GGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGCTGGATTACCTC 
TCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTGGAGCCCAGCAT 
GCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTG 
CTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGA 
AG C C C AAAGG AAT T GCAC T G T GG CAGCAT CAGAC G TAC T C G T CAT AAG T GAGAG GC G T G T G T 

TGACTGATTGACCCAGCGCTTTGGAAA.TAAA.TGGCAGTGCTTTGTTCACTTAAAGGGACCAA 
G C TAAAT T T G TAT T G G T T C AT G TAG T GAAG T C AAAC T GT TAT T CAGAGATG T T TAAT GC ATA 

TTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGC 
TGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTGTGGGCTCCTCT 
GTCTCTGGAGAGTCTGGTCATGTGGAGGTGGGGTTTATTGGGATGCTGGAGAAGAGCTGCCA 
GGAAGTGTTTTTTCTGGGTCAGTAAATAACAACTGTCATAGGGAGGGAAATTCTCAGTAGTG 
ACAGTCAACTCTAGGTTACCTTTTTTAATGAAGAGTAGTCAGTCTTCTAGATTGTTCTTATA 
CCACCTCTCAACCATTACTCACACTTCCAGCGCCCAGGTCCAAGTCTGAGCCTGACCTCCCC 
TTGGGGACCTAGCCTGGAGTCAGGACAAATGGATCGGGCTGCAGAGGGTTAGAAGCGAGGGC 
ACCAGCAGTTGTGGGTGGGGAGCAAGGGAAGAGAGAAACTCTTCAGCGAATCCTTCTAGTAC 
TAG T T GAGAG T T T GAC T G T GAAT TAAT T T T AT GCCATAAAAG AC CAAC C CAG TTCTGTTTGA 
C TAT G TAGC AT C T TGAAAAGAAAAAT TATAAT AAAG CCC CAAAAT TAAGAAAA 
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FIGURE 109 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA53977 
<subunit 1 of 1, 243 aa, 1 stop 
<MW: 27228, pi: 7.43, NX(S/T): 2 

MAKVEQVLSLEPQHELKFRGPFTDWTTNLKLGNPTDRNVCFKVKTTAPRRYCmPNSGII^ 
AGASINVSVMLQPFDYDPNEKSKHKFMVQSMF^ 

LPAENDKPHDVEINKI ISTTASKTETPIVSKSLSSSLDDTEVKKVMEECKRLQGEVQRLREE 
NKQFKEEDGLRMRKTVQSNSPISAIAPTGKEEGLSTRLLALVVXFFIVGVIIGKIA^ 

Important features: 

Putative transmembrane domain: 

amino acids 224-239 

N-glycosylation site. 

amino acids 68-71 

N-myristoylation site, 

amino acids 59-64', 64-69 and 235-24 0 
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FIGURE 110 



GTCAGTCTTCTAGATTGTCCTTATCCCACCTTTCAACCANTACTCACATTTCNAGCGCCCAG 

GTCCANGTCTGAGCCTGACTTCCCCTTGGGGACCTAGCCTGGAGTCAGGACAATGGNTCGGG 

CTGCAGAGGNTTAGAAGCGAGGGCACCAGCAGTTTTGGGTGGGGAGCAAGGGNNGAGAGAAA 

CTCTTCAGCGAATCCTTCTAGTACTAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATA 

AAAGACNAACCCAGTTCTGTTTGACTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCC 

CCAAAATTAAGAATTCTTTTGTCATTTTGTCACATTTGCTCTATGGGGGGAATTATTATTTT 
ATCATTTTTATTATTTTGCCATTGGAAGGTTAACTTTAAAATGAGC 
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FIGU R E 111 

TATTGTAAAGGCCATTTTAAACCATTGGTAGGCCTTGGTACATGATGCTGGATTACCTCCTT 
AAATGACACCNTTCCTCGCCTGTTGGTGCTGGCCNTTGGGGAGCTGGAGCCCCAGCATGCTG 
GGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTGCTTT 
CCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGAAGCC 
CAAAGGAATTGCCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGTTGA 
CTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAAGCT 
AAATTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATATTTA 

ACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGCTGCG 
TGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTG 
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FIGURE 112 



CCCTGGTGGTTTTGTTCTTTAATTCGTTGGTGTAATTNTTGGGAAGATTGCTTGTAGAGGTA 
GNAT GC AC CNGG C T G G T AAAT T GGAT TGGTG GAT CCACC ATAT C C ATGG GAT T T AAAT T TAT 
CAT AAC CAT GTG TAAAAAGAAATTAAT GTATGATGACATNT CACAGGTATTGCC TT TAAATT 
AC C CAT C C C T GNAN AC AC AT ACAC AG AT AC AC ANANACAAATNTAATG T AACG ATJSTT T T TAG 
AAAG T T AAAAAT G TATAG TAAC 
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FIGURE 113 



GGTGGCCCATTCCCGGCCCAGGCTGCTTTCCGGTNTTCAGTTCTGTCCAAGCCATCAGCTC^ 
T TGGGAC T GAT GAACAGAG T CAGAAGCCCAAAGGAATTGCAC T GTGGC AGCATNAGACGTAC 
TTGTNATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGT 
GCTTTGTTCANTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTG 
T TAT T C AGAG AT G T T T AAT GC AT AT TT AANT TAT T TAATG T AT T TNATNT CATG T TT TCTT A 
TTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAANTNTGTTGGGTGAACTGGTATTGC 
TGCTGGAGGGCTGTGGGCTCCTCTGTCTTTGGAGAGTCTGGTCATGTGGAGGTGGG 
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FIGURE 114 

TGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTTGATGAACAGAGTC 
AGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTG 
TGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGAC 
CAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGC 
AT AT T T AAC T TAT T T AAT G TAT T T CAT C TC AT G T T T T C T TAT T G T CACAAGAG TAC AG T TAA 
TGCTGCGTGC 
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FIGURE 115 



PCT/US99/05028 



AAAC C T T T AAAAG T T GAG G G GAAAAGAAT GAT C C T TT ATTAAT GACAAG GGAAAC CNTGNGT 
AAT GCC AC AAT G G CAT AT T G T AAAT G T CAT T T T AAAC AT T G G T AGGC C T T GG T AC AT GAT G C 
TGGATTACCTCTCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTN 
GAGCCCAGCATGCTGGGGAGTGCGGTCTGCTCCACACAGTAGTCCCCANGTGGCCCANTCCC 
GGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGANTGATGA 
ACAGAG T C AGAAG C CCAAAGGAATTGCANTG TGGCAGCAT CAGANG TANTNG TCATAAGTGA 
GAGGCGTGTGTTGANTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCANTT 
7LAAGGGNCCAAGNTAAATTTGTATTGGTTCATGTAGTGAAGTCAAANTGTTATTCAGAGATG 
TTTAATGCATATTTAANTTATTTAATGTATTTCATNTCATGTTTTCTTATTGTCACAAGGGT 
ACAGTTAATGCTGCGTGCTGCTGAANTCTGTTGGGTGAANfTGGTATTGCTG 
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FIGURE 116 

GGCCCTTGGGGAGCTGGAGCCCAGCATGCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCC 
CACGTGGCCCACTCCCGGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGC 
T C C T T G GGAC T GAT G AAC AG AG T CAGAAGCCCAAAGGAAT T GC AC T G T GGCAGC AT G AGACG 
TACTCGTCATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGC 
AGTGCTTTGTTCACTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAA 
CTGTTATTCAGAGATGTTTAATGCATATTTAACTTATTTAATGTATTTCATCTCATGTTTTC 
TTATTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTAT 
TGCTGCTGGAGGGCTGTGGGCTCCTCTGTCTCTGGAGAGTCTGGTCATGTGGAGGTGGG 



Mil 233- 
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FIGUR E 117 

GCGAGCTCCGGGTGCTGTGGCCCGGCCTTGGCGGGGCGGCCTCCGGCTCAGGCTGGCTGAGA 
GGCTCCCAGCTGCAGCGTCCCCGCCCGCCTCCTCGGGAGCTCTGATCTCAGCTGACAGTGCC 
CTCGGGGACCAAACAAGCCTGGCAGGGTCTCACTTTGTTGCCCAGGCTGGAGTTCAGTGCCA 
TGATCATGGTTTACTGCAGCCTTGACCTCCTGGGTTCAAGCGATCCTGCTGAGTAGCTGGGA 
C T AC AG G ACAAAAT T AG AAGAT C AAAATSGAAAAT AT GCTGCTTTGGTT GATAT T T T T C AC C 

CCTGGGTGGACCCTCATTGATGGATCTGAAATGGAATGGGATTTTATGTGGCACTTGAGAAA 
GGTACCCCGGATTGTCAGTGAAAGGACTTTCCATCTCACCAGCCCCGCATTTGAGGCAGATG 
CTAAGATGATGGTAAATACAGTGTGTGGCATCGAATGCCAGAAAGAACTCCCAACTCCCAGC 
CTTTCTGAATTGGAGGATTATCTTTCCTATGAGACTGTCTTTGAGAATGGCACCCGAACCTT 
AAC C AG G G T GAAAG T T C AAGAT TTGGTTCTT GAG CCG AC T C AAAAT AT CACCACAAAGGGAG 
TAT C T G T TAG G AGAAAG AG AC AG G T G TAT G G C AC C GAC AG CAGG T T C AG CAT C T T G G ACAAA 
AGGTTCTTAACCAATTTCCCTTTCAGCACAGCTGTGAAGCTTTCCACGGGCTGTAGTGGCAT 
TCTCATTTCCCCTCAGCATGTTCTAACTGCTGCCCACTGTGTTCATGATGGAAAGGACTATG 
T C AAAG G GAG T AAAAAG C T&AG G G TAGG G T T G T T GAAG AT GAGGAAT AAAAG TGGAG GCAAG 
AAAC G T C GAG G T T C T AAG AG GAGC AGGAG AGAAG C TAG T G G T GG T GAC C AAAGAGAGG G T AC 

CAGAGAGCATCTGCAGGAGAGAGCGAAGGGTGGGAGAAGAAGAAAAAAATCTGGCCGGGGTC 
AGAG GAT T G C C GAAG G GAG GCCTTCCTT TCAG T GG AC C C GGG T C AAGAAT AC C CAC AT T CCG 

aagggctgggcacgaggAggcatgggggacgctaccttggactatgactatgctcttctgga 

G C T GAAG C G T G C T C AC AAAAAGAAAT AC AT GGAAC T T GG AAT CAG C C C AAC GAT C AAGAAAA 
TGCCTGGTGGAATGATCCACTTCTCAGGATTTGATAACGATAGGGCTGATCAGTTGGTCTAT 
CGGTTTTGCAGTGTGTCCGACGAATCCAATGATCTCCTTTACCAATACTGCGATGCTGAGTC 
GGGCTCCACCGGTTCGGGGGTCTATCTGCGTCTGAAAGATCCAGACAAAAAGAATTGGAAGC 
GCAAAATCATTGCGGTCTACTCAGGGCACCAGTGGGTGGATGTCCACGGGGTTCAGAAGGAC 
TACAACGTTGCTGTTCGCATCACTCCCCT71AAATACGCCCAGATTTGCCTCTGGATTCACGG 
GAAC GAT G C CAAT T G T G C T T AC GG CJAAC AGAGACC T GAAAC AGG GC G G T G TATCAT C TAAA 
T C AC AG AGAAAAC CAG C T C T GC T T AC C G TAG T GAGAT CAC T T CATAGG T T ATGC C T GGACT T 

GAACTCTGTCAATAGCATTTCAACATTTTTCAAAATCAGGAGATTTTCGTCCATTTAAAAAA 
T G TAT AG G T G CAG AT AT T GAAAC TAG G TGGGC AC T T CAAT GC C AAG TAT AT AC T C T T C T T TA 

CATGGTGATGAGTTTCATTTGTAGAAAAATTTTGTTGCCTTCTTAAAAATTAGACACACTTT 
AAAC C T T C AAAC AG G TAT T AT AAATAAC AT G T GAC T C C T TAAT GG AC T TAT T C T C AGG G TCC 

TACTCTAAGAAGAATCTAATAGGATGCTGGTTGTGTATTAAATGTGAAATTGCATAGATAAA 
G G TAG AT G G T AAAGCAAT TAG TAT C AGAATAGAGACAGAAAG T T ACAACAC AG T T T G TAC TA 

CTCTGAGATGGATCCATTCAGCTCATGCCCTCAATGTTTATATTGTGTTATCTGTTGGGTCT 
GGGACATTTAGTTTAGTTTTTTTGAAGAATTACAAATCAGAAGAAAAAGCAAGCATTATAAA 
CAAAACTAATAACTGTTTTACTGCTTTAAGAAATAACAATTACAATGTGTATTATTTAAAAA 
T GG GAGAAAT AG T T T G T T C T ATGAAAT AAAC C TAG TT T AGAAAT AG GGAAG C T GAGAC ATT T 
T AAGAT C T CAAG T T T T TAT T T AAC TAAT AC TCAAAATAT GGAC T T T T CAT G T ATGC ATAGGG 
AAGACAC T T C AC AAAT TAT G AAT GAT C ATG T G TTGAAAG CCACAT TAT T T TATG C TATACAT 
TCTATGTATGAGGTGCTACATTTTTAGGACAAAGAATTCTGTAATCTTTTTCAAGAAAGAGT 
CTTTTTCTCCTTGACAAAATCCAGCTTTTGTATGAGGACTATAGGGTGAATTCTCTGATTAG 
TAATTTTAGATATGTCCTTTCCTAAAAATGAATAAAATTTATGAATATGA 
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FIGURE 118 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA57253 
<subunit 1 of 1, 413 aa, 1 stop 
<MW: 47070, pi: 9.92; NX(S/T): 3 
MENMLLWLIFFTPGWTLIDGSEMEWDFT^ 

GIECQKELPTPSLSELEDYLSYETVFENGTRTLTRVKVQDLVLEPTQNITTKGVSVRRKRQV 

YGTDSRFSILDKRFLTNFPFSTAVKLSTGCSGILISPQHVLTAAHCVHDGKDYVKGSKKLRV 

GLLKMRNKSGGKKRRGSKRSRREASGGDQREGTREHLQERAKGGRRRKKSGRGQRIAEGRPS 

FQWTRVKNTHIPKGWARGGMGDATLDYDYALLELKRAHKKKYMELGISPTIKKMPGG 

GFDNDRADQLVYRFCSVSDESNDLLYQYCDAESGSTGSGVYLRLKDPDKKNWKRKIIAVYSG 
HQWVDVHG VQKD YNVAVR IT? LKYAQ ICLWI HGNDANC AYG 

Important features : 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 90-93, 110-113 and 193-196 

Glycosaminoglycan attachment site, 
amino acids 236-239 

Serine proteases, trypsin family, histidine active site, 
amino acids 165-17 0 




WO 99/46281 PCT/US99/0S028 

FIGURE 119 

i 

AAT G T GAG AG G G G C T GAT G G AAG C T G ATAGG C AGGAC T GGAG T G T TAG C AC CAG TAC TGGAT 
G T G A C A G C AG G C AG AG GAG C AC T TAG C AG C T T A T T C AG T G T C C GAT T C T GAT TCCGG C AAG G 
ATCCAAGCA^CiGAATGCTGCCGTCGGGCAACTCCTGGCACACTGCTCCTCTTTCTGGCTTTC 
CTGCTCCTGAGTTCCAGGACCGCACGCTCCGAGGAGGACCGGGACGGCCTATGGGATGCCTG 
GGGCCCATGGAGTGAATGCTCACGCACCTGCGGGGGAGGGGCCTCCTACTCTCTGAGGCGCT 

GCCTGAGCAGCAAGAGCTGTGAAGGAAGAAATATCCGATACAGAACATGCAGTAATGTGGAC 
T G C C C AC CAG AAG CAG G T GAT T TC CGAG C T CAGC AAT G C T CAG C T C AT AATGAT G T C AAGCA 

CCATGGCCAGTTTTATGAATGGCTTCCTGTGTCTAATGACCCTGACAACCCATGTTCACTCA 
AG T G C C AAG C C AAAGG AAC AAC CCTGGTTGTT GAAC T AGCAC C T AAG G T C T T AGAT GG T ACG 

CGTTGCTATACAGAATCTTTGGATATGTGCATCAGTGGTTTATGCCAAATTGTTGGCTGCGA 
TCACCAGCTGGGAAGCACCGTCAAGGAAGATAACTGTGGGGTCTGCAACGGAGATGGGTCCA 
CCTGCCGGCTGGTCCGAGGGCAGTATAAATCCCAGCTCTCCGCAACCAAATCGGATGATACT 

GTGGTTGCACTTCCCTATGGAAGTAGACATATTCGCCTTGTCTTAAAAGGTCCTGATCACTT 
AT AT C T G G AAAC CAAAAC C C T C C AG GGG AC T AAAGG T G AAAAC AG T C T CAG C T C C ACAGGAA 
CTTTCCTTGTG G AC AAT T C TAG T G T GGAC T T C CAG AAAT T T C C AG AC AAAGAGAT AC T GAGA 
ATGGCTGGACCACTCACAGCAGATTTCATTGTCAAGATTCGTAACTCGGGCTCCGCTGACAG 
TACAGTCCAGTTCATCTTCTATCAACCCATCATCCACCGATGGAGGGAGACGGATTTCTTTC 
CTTGCTCAGCAACCTGTGGAGGAGGTTATCAGCTGACATCGGCTGAGTGCTACGATCTGAGG 
AG C AAC CGTGTGGTTGCT G AC C AAT AC T G T C AC T ATT AC C C AGAG AAC AT C AAAC C C AAAC C 

CAAGCTTCAGGAGTGCAACTTGGATCCTTGTCCAGCCAGTGACGGATACAAGCAGATCATGC 
CTTATGACCTCTACCATCCCCTTCCTCGGTGGGAGGCCACCCCATGGACCGCGTGCTCCTCC 
TCGTGTGGGGGGGGCATCCAGAGCCGGGCAGTTTCCTGTGTGGAGGAGGACATCCAGGGGCA 
T G T C AC T T CAG T GGAAGAG T G GAAATG CATG T AC ACCCC TAAGAT GC C CAT C GCGC AGCCC T 

GCAACATTTTTGACTGCCCTAAATGGCTGGCACAGGAGTGGTCTCCGTGCACAGTGACATGT 

GGCCAGGGCCTCAGATACCGTGTGGTCCTCTGCATCGACCATCGAGGAATGCACACAGGAGG 

CTGTAGCCCAAAAACAAAGCCCCACATAAAAGAGGAATGCATCGTACCCACTCCCTGCTATA 

AACCCAAAGAGAAACTTCCAGTCGAGGCCAAGTTGCCATGGTTCAAACAAGCTCAAGAGCTA 

GAAGAAGGAGCTGCTGTGTCAGAGGAGCCCTCGTfi^GTTGTAAAAGCACAGACTGTTCTATA 

TTTGAAACTGTTTTGTTTAAAGAAAGCAGTGTCTCACTGGTTGTAGCTTTCATGGGTTCTGA 

ACTAAGTGTAATCATCTCACCAAAGCTTTTTGGCTCTCAAATTAAAGATTGATTAGTTTCAA 
AAAAAAAAA 
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F I GURE 1 20 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA58847 
<subunit 1 of 1, 525 aa, 1 stop 
<MW: 58416, pi: 6.62, NX(S/T): 1 

MECCRRATPGTLLLFLAFLLLSSRTARSEEDRDGLWDAWGPWSECSRTCGGGASY5LRRCLS 
SKSCEGRNIRYRTCSNVDCPPEAGDFRAQQCSAHNDVKHHGQFYEWLPVSNDPDNPCSLKCQ 
AKGTTLWELAPKVLDGTRCYTESLDMCISGLCQIVGCDHQLGSTVKEDNCGVCNGDGSTCR 
LVRGQYKSQLSATKSDDTWALPYGSRHIRLVLKGPDHLYLETKTLQGTKGENSLSSTGTFL 
VDNSSVDFQKFPDKEILRMAGPLTADFIVKIRNSGSADSTVQFIFYQPIIHRWRETDFFPCS 
ATCGGGYQLTSAECYDLRSNRWADQYCHYYPENIKPKPKLQECNLDPCPASDGYKQIMPYD 
LYHPLPRWEATPWTACSSSCGGGIQSRAVSCV^EDIQGHVTSTOEWKCMYTPKMPIAQPCNI 
FDCPKWLAQEWSPCTVTCGQGLRYRWLCIDHRGMHTGGCSPKTKPHIKEECIVPTPCYKPK 
EKL P VE AKL P W FKQAQE LEE GAAVS EE P S 

Important features : 
Signal peptide: 

amino acids 1-25 

N-glycosylation site. 

amino acids 251-254 

Thrombospondin 1 

amino acids 385-399 

von Willebrand factor type C domain proteins 
amino acids 385-399, 445-459 and 42-56 
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FIGURE 12 1 

CGGACGCGTGGGCGGCGGCTGCGGAACTCCCGTGGAGGGGCCGGTGGGCCCTCGGGCCTGAC 

AG^EGGCAGTGGCCACTGCGGCGGCAGTACTGGCCGCTCTGGGCGGGGCGCTGTGGCTGGCG 

GCCCGCCGGTTCGTGGGGCCCAGGGTCCAGCGGCTGCGCAGAGGCGGGGACCCCGGCCTCAT 

GCACGGGAAGACTGTGCTGATCACCGGGGCGAACAGCGGCCTGGGCCGCGCCACGGCCGCCG 

AGCTACTGCGCCTGGGAGCGCGGGTGATCATGGGCTGCCGGGACCGCGCGCGCGCCGAGGAG 

GCGGCGGGTCAGCTCCGCCGCGAGCTCCGCCAGGCCGCGGAGTGCGGCCCAGAGCCTGGCGT 

CAGCGGGGTGGGCGAGCTCATAGTCCGGGAGCTGGACCTCGCCTCGCTGCGCTCGGTGCGCG 

CCTTCTGCCAGGAAATGCTCCAGGAAGAGCCTAGGCTGGATGTCTTGATCAATAACGCAGGG 

ATCTTCCAGTGCCCTTACATGAAGACTGAAGATGGGTTTGAGATGCAGTTCGGAGTGAACCA 

TCTGGGGCACTTTCTACTCACCAATCTTCTCCTTGGACTCCTCAAAAGTTCAGCTCCCAGCA 

GGATTGTGGTAGTTTCTTCCAAACTTTATAAATACGGAGACATCAATTTTGATGACTTGAAC 

AGTGAACAAAGCTATAATAAAAGCTTTTGTTATAGCCGGAGCAAACTGGCTAACATTCTTTT 

TACCAGGGAACTAGCCCGCCGCTTAGAAGGCACAAATGTCACCGTCAATGTGTTGCATCCTG 
G TAT T G T AC G G AC AAAT C T GGGGAG G C ACAT AC AC AT T CCAC T G T T GGT C AAACC AC T C TTC 

AATTTGGTGTCATGGGCTTTTTTCAAAACTCCAGTAGAAGGTGCCCAGACTTCCATTTATTT 
GGCCTCTTCACCTGAGGTAGAAGGAGTGTCAGGAAGATACTTTGGGGATTGTAAAGAGGAAG 
AACTGTTGCCCAAAGCTATGGATGAATCTGTTGCAAGAAAACTCTGGGATATCAGTGAAGTG 
ATGGTTGGCCTGCTAAAA^^GAACAAGGAGTAAAAGAGCTGTTTATAAAACTGCATATCAG 

TTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACTTGAAGAAAAAGAATTTTG 
AT AT T G G AAT AG C C T G C T AAGAGG TAC AT G T G G G TAT T T T GGAG T T AC T GAAAAAT TAT TT T 
T G GG AT AAGAG AAT T T C AG CAAAGAT G T T T TAAATAT ATATAG T AAG TAT AAT GAAT AAT AA 

GTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCAAGCATGGATGACATATTA 
ATAT T T G T C AGAAT T AAG T GAC T CAAAG T G CT AT CGAGAGG TT T T T CAAG TAT C T T T GAG T T 

TCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTTTGTGTGGAAATTATCTGC 
CTGGTGTGTGCACACAAGTCTTACTTGGAATAAATTTACTGGTAC 



A £3/£3f 
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F IGURE 122 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA58747 
<subunit 1 of 1, 336 aa, 1 stop 
<MW: 36865, pi: 9.15, NX(S/T): 2 

l^VATAAAVLAALGGALWLAARRFVGPRVQRLRRGGDPGLMHGKTVL I TGANS GLGRATAAE 

LLRLGARVIMGCRDRARAEEAAGQLRRELRQAAECGPEPGVSGVGELIVRELDIASLRSTOA 

FCQEMLQEEPRLDVLINNAGIFQCPYMKTEDGFEMQFGVNHLGHFLLTNLLLGLLKSSAPSR 

IVWSSKLYKYGDINFDDLNSEQSYNKSFCYSRSKLANILFTRELARRLEGTWTVNVLH 

IVRTNLGRHIHIPLLVKPLFNLVSWAFFKTPVEGAQTSIYLASSPEVEGVSGRYFGDCKEEE 

LLPKAMDE5VARKLWDI SEVMVGLLK 

Important features: 
Signal peptide: 

amino acids 1-21 

Short-chain alcohol dehydrogenase family protein 

amino acids 134-144, 44-56 and 239-248 

N-glycosylation site. 

amino acids 212-215 and 239-242 
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FIGUR E 1 23 

GGGGATTGTAAAGAGGAAGNACTGTGCCCAAAGNTATGGATGAATCTGTTGCAAGAAAATTN 
T GGGAT AT C AG T GAAG T GAT GGTTNGCCTGC T AAAATAG GAACAAGGAG T AAAAG AG C T GT T 
TATAAAAC TG C AT AT CAG T T ATATCTGTGAT CAGGAAT GG TGTGGATTGAGAAC T T G TTAC T 
T GAAGAAAAAG AAT T T T GAT AT T GGAAT AG C G T GNT AAG AG GNAC ATG T G GG TAT T T TGGAG 
T TAC T G AAAAAT TAT T T T T G GGATAAGAGAAT T T C AGCAAAGAT G T T T TAAAT AT ATATAG T 
AAGTATAATGAATAATAAGTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCA 
AGCATGGATGACATATTAATATTTGTCAGAATTAAGTGACTCAAAGTGCTATCGAGAGGTTT 
TTCAAGTATCTTTGAGTTTCATGGCCAAAGTGTT7UVCTAGTTTTACTACAATGTTTGGTGTT 
TGTGTGGAAATTATCTGCCTGGCTT 
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FIGURE 124 



GAGAGGACGAGGTGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCC 
CTTTCCTAACCCAACCCAACCTAGCCCAGTCCCAGCCGCCAGCGCCTGTCCCTGTCACGGAC 
CCCAGCGTTACCASSCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCT 
GCTCCTGGT7VACTTGGGTTTTTACTCCTGTAACAACTGAAATAACAAGTCTTGCTACAGAGA 
ATATAGATGAAATTTTAAACAATGCTGATGTTGCTTTAGTAAATTTTTATGCTGACTGGTGT 
CGTTTCAGTCAGATGTTGCATCCAATTTTTGAGGAAGCTTCCGATGTCATTAAGGAAGAATT 
TC C AAAT GAAAAT C AAG TAG T G T T T G CC AGAG TT GAT T G TG AT C AG C AC T C T GAC AT AGCC C 
AG AGAT AC AG G AT AAG C AAAT AC C C AAC C C T C AAAT T G T T T C G T AAT G G GAT GAT GAT GAAG 
AG AGAAT AC AG GG G T C AG C GAT C AG T GAAAGC AT T GGC AGAT T AC AT C AGGC AACAAAAAAG 
TGACCCCATTCAAGAAATTCGGGACTTAGCAGAAATCACCACTCTTGATCGCAGCAAAAGAA 
ATATCATTGGATATTTTGAGCAAAAGGACTCGGACAACTATAGAGTTTTTGAACGAGTAGCG 
AATATTTTGCATGATGACTGTGCCTTTCTTTCTGCATTTGGGGATGTTTCAAAACCGGAAAG 
AT AT AG T G G C GAC AAC AT AAT C T AC AAAC C AC C AG GG C AT TCTGCTCCG GAT AT GG T G TAC T 
TGGGAGCTATGACAAATTTTGATGTGACTTACAATTGGATTCAAGATAAATGTGTTCCTCTT 
G T C C GAG AAAT AAC AT T T GAAAAT GGAGAGGAAT T G AC AGAAGAAGGAC TGCCTTTT C TCAT 
ACTCTTTCACATGAAAGAAGATACAGAAAGTTTAGAAATATTCCAGAATGAAGTAGCTCGGC 
AATTAATAAGTGAAAAAGGTACAATAAACTTTTTACATGCCGATTGTGACAAATTTAGACAT 
CCTCTTCTGCACATACAGAAAACTCCAGCAGATTGTCCTGTAATCGCTATTGACAGCTTTAG 
GCATATGTATGTGTTTGGAGACTTCAAAGATGTATTAATTCCTGGAAAACTCAAGCAATTCG 
TAT T T GAC T TAC AT T C T G G AAAAC T G C AC AGAGAAT.T CC AT CAT G GAC C T GAC CC AAC T GAT 
ACAGC C CCAGGAGAGC AAGC CCAAGATGTAGCAAGCAGT CCACC TGAGAGCTCCT TCCAGAA 
AC TAG CAC C C AG T G AAT AT AGGT AT AC TC TAT TGAG GGAT C GAGAT GAG C T T^^AAAC T T G 
AAAAAC AG T T T G T AAG C C T T T CAACAGCAGCATCAACC TACG TG G T G G AAAT AGT AAACC TA 
TAT T T T CATAAT TC TATG T GTAT TTTTAT T T TGAATAAACAGAAAGAAATTTAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 12B 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss . DNA57689 
<subunit 1 of l, 406 aa, 1 stop 
<MW: 46927, pi: 5.21, NX (S'/T) : 0 

MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLATENIDEILNNADVALVNFyADWCRFSQ 
MLHPI FEEASDVIKEEFPNENQWFARVDCDQHSDIAQRYRI S K Y P T LKL FRN GMMMKRE YR 
GQRS VKALAD Y I RQQKS DP I QE IRDLAE I TTLDRSKRNI I G Y FE QKDS DN YRVFERVAN I LH 
DDCAFLSAFGDVSKPERYSGDNIIYKPPGHSAPDMVYLGAMTNFDVTYNWIQDKCVPLVREI 
TFENGEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINFLHADCDKFRHPLLH 

IQKTPADCPVIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDLHSGKLHREFHHGPDPTDTAPG 
EQAQDVASSPPESSFQKLAPSEYRYTLLRDRDEL 

Important features : 
Signal peptide: 

amino acids 1-29 

Endoplasmic reticulum targeting sequence. 

amino acids 403-406 

Tyrosine kinase phosphorylation site. 

amino acids 203-211 

Thioredoxin family proteins 
amino acids 50-66 
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FIGURE 126 

ATTAAGGAAGAATTTCCAAATGAAAATCAAGTAGTNTTTGCCAGAGTNGATTGTGATCAGCA 
CTCTGACAT AG C C C AG AG ATACAGGATAAGCAAATACCCAACC CTCAAATTGTTTCG TAATG 
GGATGATGATGAAGAGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTA 
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AGAGGCCTCTCTGGAAGTTGTCCCGGGTGTTCGCCGCNGGAGCCCGGGTCGAGAGGACNAGG 
TGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCCCTTTCCTAACCC 
AACCCAACCTAGCCCNGTCCCAGCCGCCAGCGCCTGTCCCTGTCNCGGJ\NCCCAGCGTNACC 
ATGCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCTGCTCCTGGTAAC 
TTGGGTTTT TAG T C C T G T AAC AAC T GAAATAACNNG T C T T GAT ACNNAGAATAT AGAT G AAA 

TTTTAAACNATGCTGATGTGGCTTTAGTCAATTTTTATGCTGACTGGTGTCGTTTCAGTCAG 
ATGTGGCATCCAATTTTTGAGGANGCTTCCGATGTCATTAAGGAAGAATTTCCAAATGAAAA 
T C AAG TAG T G T T T G C C AG AG T T GAT T G T GAT CAGCAC T C T GAC AT AGC C CAGAGAT AC AGGA 
T AAG CAAAT AC C C AAC C C T CAAAT TG T T TC G TAAT GGG AT GAT GAT GAAG AGAGAAT AC AGG 
GG T CAGC GAT C AG T GAAAG CAT T G G CAGAT TACAT C AGGC 
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FIGURE 128 



GCCCACGCGTCCG^ESGCGTTCACGTTCGCGGCCTTCTGCTACATGCTGGCGCTGCTGCTCA 
CTGCCGCGCTCATCTTCTTCGCCATTTGGCACATTATAGCATTTGATGAGCTGAAGACTGAT 
TAC AAG AAT C C T AT AGAC C AG TG TAATACC C T GAAT C C CC T TG TAC T C C CAGAG T AC C T CAT 
CCACGCTTTCTTCTGTGTCATGTTTCTTTGTGCAGCAGAGTGGCTTACACTGGGTCTCAATA 
TGCCCCTCTTGGCATATCATATTTGGAGGTATATGAGTAGACCAGTGATGAGTGGCCCAGGA 
CTCTATGACCCTACAACCATCATGAATGCAGATATTCTAGCATATTGTCAGAAGGAAGGATG 
GTGCAAATTAGCTTTTTATCTTCTAGCATTTTTTTACTACCTATATGGCATGATCTATGTTT 
T G G T GAG C T C T JE&gAAC AAC ACACAGAAGAAT T G G T C CAG T TAAG T G CAT GCAAAAAG CC AC 
CAAATGAAGGGAT T C TAT C CAGCAAGATCCTG TC CAAGAG TAGCCTG TGGAATCT GATCAG T 
TACTTTAAAAAATGACTCCTTATTTTTTAAATGTTTCCACATTTTTGCTTGTGGAAAGACTG 
TTTT CAT AT G T TAT AC T C AG AT AAAGAT T T TAAAT GGTAT TAC G TATAAAT T AAT AT AAAAT 
GATTACCTCTGGTGTTGACAGGTTTGAACTTGCACTTCTTAAGGAACAGCCATAATCCTCTG 
AATGATGCATTAATTACTGACTGTCCTAGTACATTGGAAGCTTTTGTTTATAGGAACTTGTA 
GGGCTCATTTTGGTTTCATTGAAACAGTATCTAATTATAAATTAGCTGTAGATATCAGGTGC 
TTCTGATGAAGTGAAAATGTATATCTGACTAGTGGGAAACTTCATGGGTTTCCTCATCTGTC 
ATGTCGATGATTATATATGGATACATTTACAAAAATAAAAAGCGGGAATTTTCCCTTCGCTT 
GAAT AT TATCCCTG TAT AT T GC AT GAATG AGAGAT T T CCCAT AT T T CCAT CAGAG TAATAAA 
T ATAC T T GC T T T AAT T C T T AAGCATAAG TAAAC AT GAT ATAAAAATATAT GC T GAAT TAC T T 
GTGAAGAATG CAT TTAAA.GC TAT TTTAAATGTGTTTTTATTTGTAAGACATTACTTAT TAAG 
AAATTGGTTATTATGCTTACTGTTCTAATCTGGTGGTAAAGGTATTCTTAAGAATTTGCAGG 
TACTACAGATTTTCAAAACTGAATGAGAGAAAATTGTATAACCATCCTGCTGTTCCTTTAGT 

GC AAT AC AAT AAAAC T C T GAAAT TAAG AC T C 
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FIGURE 129 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA23330 
<subunit 1 of 1, 144 aa, 1 stop 
<MW: 16699, pi: 5.60, NX(S/T): 0 

MAFT FAAFC YMLALLL T AAL I FFAIWH 1 I AFDELKTDYKNP I DQCNTLNPLVLPE YL IHAFF 

CVMFLCAAEWLTLGLNMPLLAYHIW^ 

FYLLAFFYYL YGMI YVLVS S 

Important features: 
Signal peptide : 

amino acids 1-20 

Type II transmembrane domain: 

amino acids 11-31 

Other transmembrane domain: 
amino acids 57-77 and 123-143 
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FIGURE 130 



AT TAT AG CAT T T GAT GAG C T GAAGAC T GAT T ACAAGATC C T ATAGACC AG T G T AAT AC C C T G 
AATCCCCTTGTACTCCCAGAGTACCTCATCCACGCTTTCTTCTGTGTCATGTTTCTTTGTGC 
AGCAGAGTGGCTTACACTGGGTCTCAATATGCCCCTCTTGGCATATCATATTTGGAGGTATA 
TGAGTAGACCAGTGATGAGTGGCCCAGGACTCTATGACCCTACAACCATCATGAATGCAGAT 
ATTCTAGCATATTGTCAGAAGGAAGGATGGTGCAAATTAGCTTTTTATCTTCTAGCATTTTT 
T TAC TAC C TATATGGCAT GATCTATGTT T TGG TGAGC TCTT AGAACAACAC ACAGAAGAAT T 
GG T C C AG T T AAG T G CAT G C AAAAAGC C AC C AAAT GAAGGGAT T C TAT C C AGC AAG AT C C T G T 
CCAAG AG TAG C C T G T GGAAT C T GAT CAG T TAC T T T AAAAAAT G 
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FIGURE 131 

CGGACGCGTGGGGGAAACCCTTCCGAGAAAACAGCAACAAGCTGAGCTGCTGTGACAGAGGG 
GAACAAGAISGCGGCGCCGAAGGGGAGCCTCTGGGTGAGGACCCAACTGGGGCTCCCGCCGC 
TGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGCTGAAGCATTTGAC 
TCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACAC 
CTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTC 
AGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACA 
GAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCC 
ATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTC 
C T C T AAC T C T G G T G AGG T CAT T C T G GAG T G ACAT GAT GGAC T C C GCACAGAGC T T C ATAACC 

TCTTCATGGACTTTTTATCTTCAAGCCGATGACGGAAAAATAGTTATATTCCAGTCTAAGCC 
AGAAATCCAGTACGCACCACATTTGGAGCAGGAGCCTACAAATTTGAGAGAATCATCTCTAA 
GCAAAAT G T C C TAT C T GC AAATGAGAAAT T CACAAGCGCACAGGAAT T T T C TTGAAGATGGA 

GAAAGTGATGGCTTTTTAAGATGCCTCTCTCTTAACTCTGGGTGGATTTTAACTACAACTCT 
TGTCCTCTCGGTGATGGTATTGCTTTGGATTTGTTGTGCAACTGTTGCTACAGCTGTGGAGC 
AGTATGTTCCCTCTGAGAAGCTGAGTATCTATGGTGACTTGGAGTTTATGAATGAACAAAAG 
CTAAACAGATATCCAGCTTCTTCTCTTGTGGTTGTTAGATCTAAAACTGAAGATCATGAAGA 
AGCAGGGCCTCTACCTACAAAAGTGAATCTTGCTCATTCTGAAAT2TAAGCATTTTTCTTTT 
AAAAG AC AAG T G T AAT AG AC AT C T AAAAT T C C AC T CC T C AT AGAG C T T T T AAAAT GG T T T C A 
T T GGAT AT AG GCC T T AAGAAAT CAC TAT AAAAT G CAAAT AAAG T TAC T CAAAT CTGTG 



WO 99/46281 
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FIGURE 132 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss.DNA2 6847 
<subunit 1 of 1, 323 aa, 1 stop 
<MW: 36223, pi: 5.06, NX(S/T): 1 

MAA.PKGSLWVRTQLGLPPLLLLTMALAGGSGTA5AEAFDSVLGDTASCHRACQLTYPLHTYP 
KEEELYACQRGCRLFSICQFVDDGIDLNRTKLECESACTEAYSQSDEQYACHLGCQNQLPFA 
ELRQEQLMS LMPKMHLL FPLTLVRS FWSDMMDSAQS FITS SWT FYLQADDGKIVT FQSKPE I 
QYAPHLEQEPTNLRESSLSKMSYLQMRNSQAHRNFLEDGESDGFLRCLSLNSGWILTTTLVL 
S VMVL L W I C CAT VAT AVE Q Y VP S EKL S I YGDLE FMNE QKLNR YPAS S LWVRS KTE DHEE AG 
PLPTKVNLAHSEI 

Important features : 
Signal peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 241-260 

N-glycosylation site . 

amino acids 90-93 




WO 99/46281 

FIGURE 133 



PCT/US99/0S028 



TTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACACCTACCC 
TAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTCAGTTTG 
TGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACAGAAGCA 
TATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGC 
T GAAC T GAG AC AAG AACAAC T TATG T CCCTGATGCCAA&AATGCACCTAC TCTTTCCTC TAA 
CTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGC 



A 25/ OR- 



WO 99/46281 PCT/US99/05028 

FIGURE 134 

CACACTGGCCGGATCTTTTAGAGTCCTTTGACCTTGACCAAGGGTCNGGAAAACAGCAACAA 
GCTGAGCTGCTGTGACAGAGGGAACAAGATGGCGGCGCCGAAGGGAGCCTTTGGGTGAGGAC 
CCAACTGGGGCTCCCGCCGCTGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCG 
CTTCGGCTGAAGCATTTGACTCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAG 
TTGACCTACCCCTTGCACACCTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTG 
CAGGCTGTTTTCAATTTGTCAGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGG 
AATGTGAATCTGCATGTACAGAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTT 
G G T T G C C AG AAT C AG C T G C CAT TC G C TGAAC T GAG AC AAG AACAAC TTAT G TC C C T GATGC C 
AAAAATGCACCTACTCTTTCCTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACT 
CCGC 
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FIGURE 135 



GCGAGGTGGCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAGGT 

GGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACCGAGC 

GTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCA&2SGGCCGCGGCTG 

GGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGC 

AGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGT 

ACCTGTG AT G TT GAAA C C AT TGATAGATTTAATAACTAC AGG CTTTTCC CAAGACTACAAAA 

ACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCGTGTCCTTTCT 

GGAATGACATCAGCCAGTGTGGAAGAAGGGACTGTGCTGTCAAACCATGTCAATCTGATGAA 

GTTCCTGATGGAATTAAATCTGCGAGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGA 

AGAATGTGAACAAGCTGAACGACTTGGAGCAGTGGATGAATCTCTGAGTGAGGAAACACAGA 

AGGCTGTT CTT C AG TGG AC C AAGCATGATGATTCTTCAGATAACTT CTG TGAAG CTGATGAC 

ATTCAGTCCCCTGAAGCTGAATATGTAGATTTGCTTCTTAATCCTGAGCGCTACACTGGTTA 

CAAGGG AC C AG ATGCTTGGAAAAT ATGGAATGT CAT CTACGAAGAAAACTGTTTTAAG CC AC 

AGACAATTAAAAGACCTTTAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGAACACT 

TTTTACAGTTGGCTAGAAGGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGG 

CCTACATGCAAGCATTAATGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAG 

AAAAGAAATGGGGACACAACATTACAGAATTTCAACAGCGATTTGATGGAATTTTGACTGAA 

GGAG AAGGT C C AAGAAGG CTTAAGAACTTGTATTTT CT CT ACTTAAT AGAACTAAGGG CTTT 

ATCCAAAGTGTTACCATTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTC 

AGGATGAGGAAAACAAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTG 

CATTTTG AT G AGAATTC ATTTTTTG CTGGGGATAAAAAAGAAGCACACAAACTAAAGGAGGA 

CTTTCGACTGCATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTC 

GTCTGTGGGGAAAGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAG 

AAATTG ATAGCAAAT ATG C C AGAAAGTGGACCTAGTTATGAATTCCATCTAACCAG ACAAGA 

AATAGT AT CATTATT C AACGCATTTGGAAGAATTTCT ACAAGTGTGAAAGAATTAGAAAACT 

TCAGGAACTTGTTACAGAATATTCATI^&AGAAAACAAGCTGATATGTGCCTGTTTCTGGAC 

AATGGAGG CGAAAGAG TGGAATTTCATTCAAAGG CATAATAGCAATGACAGTCTTAAG CCAA 

ACATTTTATATAAAGTTGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACATTTTT 

AAAAATTG TGTTAAG T CT ATGTATAATACTACTGTGAGTAAAAGTAATACTTTAATAATGTG 

GTACAAATTTTAAAGTTTAATATTGAATAAAAGGAGGATTATC^^ 

AAAAAAAAAAAAAAAAAAAAAAAAA 




WO 99/46231 PCT7US99/O5028 

FIGURE 138 

</usr/seqdfc2/sst/DNA/Dnaseqs .mill/ ss . DNA5 397 4 
<subunit 1 of 1, 4 63 aa, 1 stop 
<MW: 54393, pi: 5.63, NX(S/T): 2 

MGRGWGFLFGLLGAWLLSSGHGEEQPPETAA.QRCFCQVSGYLDBCTCDVETIDRFNNYRLF 
PRLQKLLESrTFRYYKVNLKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKYSEEA 
NNL IEECEQAERLGAVDES LSEETQKAVLQWTKHDDSSDNFCEADD I QS PEAEYVDLLLNPE 

RYTGYKGPDAWKIWWIYEENCFKPQTIKRPLNPLASGQGTSEENTFYSWLEGLCVEKRAFY 
RL I SGLHAS INVHLSARYLLQETWLEKKWGHNI TEFQQRFDGILTEGEGPRRLKNLYFLYLI 
ELRALS KVLP FFERPDFQL FTGNKI QDEENKMLLLEILHE IKS FPLHFDENSFFAGDKKEAH 

KLKEDFRLHFRNISRIMDCVGCFKCRLWGKLQTQGLGTALKILFSEKLIANMPESGPSYEFH 
LTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 

Important features : 
Signal peptide: 

amino acids 1-23 

N-glycosylation site, 

amino acids 280-283 and 384-387 

Amidation site, 

amino acids 94-97 

Glycosaininoglycan attachment site. 

amino acids 20-23 and 223-226 

Aminotransferases class-V pyridoxal -phosphate 
amino acids 216-222 

Interleukin-7 proteins 

amino acids 338-343 



WO 99/46281 PCT/US99/0502S 

FI GURE 132 

GCTGGAAATATGGATGTCATCTACGAGi^.CTGTTTTAAGCCACAGACA^TTAAAAGACCTT 
TAAAT C C T T T GGC T T C T GG T CAAGGGACAAGTGAAGAGNACACTTTTTACAGT T GGCTAGA A 
GGTCTCTGTG T AGAAAAAAG AGCAT T CTAC AGAX T T ATAT C T GG C C TACAT GC AAGCAT TAA 

TGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTA.GAAAAGAAATGGGGA.CACA 
AC AT T AC AGAAT T T NAJVC AGC GAT T T GAT GGAAT T T T GAC T GAAG G AGAAGG T C C AAGAAGG 
C T TAAGAAC T T G TAT T T T C T CT AC T TAATAGAAC T AAGG G C T T TAT C CAAAG T G T TAG CAT T 
C T TNG AG CGCC C AG A T T T T C AAC TNT T TAC T GGAAAT AAAAT T C AGGAT GAGGNAAAC AAAA 

TGTTACTTTTGGAAATACTTCATGAAATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCA 
TTTTTTTGCTG 



WO 99/46281 PCT/US99/C5028 

FIGURE 1 3 8 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGTTGGGAGGGGGCAGGATGGGAGGGAA 
AG T G AAG AAAACAGAAAAGG AGAGGGAC AGAGG C CAGAGG AC T T C T CAT AC T GGAC AGAAAC 

CGATCAGGCA1SGAACTCCCCTTCGTCACTCACCTGTTCTTGCCCCTGGTGTTCCTGACAGG 
TCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCTATTCCCAGGGCCACCAGAAG 
CTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGACAGCGATGGATGCTGGTGGGC 
GCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTTTATCGCTGCCCTGTAGGGGG 
GGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTACCAACTGGGAAATTCATCTC 
ATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGACAGATGGTGATGGGGGATTC 
AT G G T GAG C TAAG GAGAGGG T GGT GGCAG T G T C TC T GAAGG T C CATAAAAGAAAAAAGAGAA 
G T G T G G T AAG G GAAAAT G G T C T GT GT GG AGGGGT CAAGGAG T TAAAAAC C C TAGAAAGC AAA 
AGGTAGGTAATGTCAGGGAGTAGTCTTCATGCCTCCTTCAACTGGGAGCATGTTCTGAGGGT 
GCCCTCCCAAGCCTGGGAGTAACTATTTCCCCCATCCCCAGGCCTGTGCCCCTCTCTGGTCT 
CGTGCTTGTGGCAGCTCTGTCTTCAGTTCTGGGATATGTGCCCGTGTGGATGCTTCATTCCA 
GCCTCAGGGAAGCCTGGCACCCACTGCCCAACGTGAGCCAGAGGAAGGCTGAGTACTTGGTT 
CCCAGAAGGAGATACTGGGTGGGAAAAAGATGGGGCAAAGCGGTATGATGCCTGGCAAAGGG 
CCTGCATGGCTATCCTCATTGCTACCTAATGTGCTTGCAAAAGCTCCATGTTTCCTAACAGA 
T T C AGAC T CCT GG C CAGG TGTGGTGGCC C ACAC CT GTAAT TC TAGCAC T T T GG GAGGC CAAG 

GTGGGCAGATCACTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACTCCAT 
CTCTACTAAAAAAAAAAAAATACAAAAATTAGCTGGGTGCGCTAGTGCATGCCTGTAATCTC 
AT C T AC T C GGG AGG C T AAGACAGGAGAC T C T CACT T CAACC CAGGAG G T GGAG GT T GC G GT G 

AGCCAAGATTGTGCCTCTGCACTCTAGCGTGGGTGACAGAGTAAGCGAGACTCCATCTCAAA 
AATAATAATAATAATAATTCAGACTCCTTATCAGGAGTCCATGATCTGGCCTGGCACAGTAA 
CTCATGCCTGTAATCCCAACATTTTGGGAGGCCAACGCAGGAGGATTGCTTGAGGTCTGGAG 
GTTTGAGACCAGCCTGGGCAACATAGAAAGACCCCATCTCTAAATAAATGTTTTAAAAAT 



WO 99/46231 
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FIGURE 139 



x/usr/seqdb2/55t/DNA/Dnaseqs-min/ss .DNA57039 
xsubunit 1 of 1, 124 aa, I stop 
XMW: 13352, pi: 5.99, NX(S/T): 1 

MELPFVTHLFLPLVFLTGLCSPEmDEHHPRLFPGPPEAEFGYSVLQHVGGGQRWMLVGAPW 
DGPSGDRRGDWKCPVGGAHNAPCAKGHLGDYQLGNSSKPAVIMELGMSLLETDGDGGFKV'S 

Important features: 
Signal peptide: 

amino acids 1-22 

Cell attachment sequence, 

amino acids 70-73 

N-glycosylation site. 

amino acids 98-101 

Integrins alpha chain proteins 
amino acids 67-81 




WO 99/46281 PCT/U599/05ri2S 

FIGURE 140 

CACAGTTCCCCACCATCACTCNTCCCATTCCTTCCAACTTTATTTTTAGCTTGCCATTGGGA 

GGGGGCAGGATGGGAGGGAAAGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGAC 

TTCTCATACTGGACAGAAACCGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTG 

CCCCTGGTGTTCCTGACAGGTCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCT 

ATTCCCAGGGCCACCAGAAGCTGAA.TTTGGATACAGTGTCTTACAACATGTTGGGGGTGGAC 

AGCGATGGATGCTGGTGGGCGCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTT 

TATCGCTGCCCTGTAGGGGGGGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTA 

CCAACTGGGAAATTCATCTCATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGA 
CAGATGGTGATGG 
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FIGURE 141 



AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 

GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 

ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 

GAGATGGACAGAA.TGCTTTATTTTGGAAAGAAACAA.TGTTCTAGGTCAAACTGAGTCTACCA 

AATSCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 

ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 
T C T G T AC T C T C AAC C AAC AT GAAGC AT C T C TT GAT GT GGAGC C C AG T GAT C GC GC C T GGAGA 

AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 
GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 
ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 
GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 
TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 
C T T G T GGC C T AC T G GAG GAGGGAGC C T G G T GCC GAGGAAC AT GT CAAAAT GGT GAGGAGTGG 
GGG TAT T C CAGT G C AC C T AGAAAC CAT GGAGC C AG GGGC T GC AT AC T GT G T GAAGGC C CAGA 

CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 
GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 

GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 
TGGTCCTCC C AGAC AC C T T GAAAATAAC CAAT T CAC C C C AGAAG T T AAT C AGC T GCAGAAGG 

GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 
C T C ATAgG T T T G C GGAAG G G C C C AGG T GAAGC C GAGAAC CT GG T C T G CAT G ACAT GGAAAC C 
AT GAG G GGAC AAG TTGTGTTTCTGTTTTCCGC CAC GGACAAGGGAT G AGAGAAG T AGGAAGA 
GC C TGT TGT CTACAAG T C TAGAAGCAAC CATCAGAGGCAGGGTGGT TTGTCTAACAGAACAC 
TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 
CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 
TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 

TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 
TGTTTCT GGAGAG C AGG AC ATAAAT G TAT GAT GAGAAT GATCAAGGAC TC TACACACT GGGT 
GG C T T GGAG AG C C CAC T T T C C C AGAATAAT C C T T GAGAGAAAAGGAAT CAT GGGAGC AAT GG 
TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 
AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 
CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 

GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 
T GT T G G T AAAG T AC AGAAT T C AGCAAAT AAAAAGGGC CAC C C T G GC CAAAAGC GGTAAAAAA 
AAAAAAAAAA 
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FIGUR E 142 



</usf /seqdb2/sst/DNA/Dnaseq5,min/ss.DNA57033 
<subunit 1 of 1, 311- aa< i stop 
<MW: 35076, pi: 5.04, NX(S/T); 2 

MQT FTMVLEE I WTS L FMWFFYALI PCLLTDEVAILPAPQNLS VLS TNMXHLLMWS FVTAPGE 

TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITAT T 7PYNLRVRATLGSQTSAW 
S I LKHP FNRNS TIL TRPGME I TKDGFHLVI ELEDLGPQ FE FLVAYWRREPGAEEEVKMVRSG 
GIPVHLETMEPGAAYCVKAQTFVTSAIGRYSAFSQTEC^ 

WPLFVWKMGRLLQYS CCPVWL P DTLKI TNSPQKL I S CRREE VDACATAVMS PEELLRAWI S 

Important features: 
Signal peptide: 
amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation site. 

amino acids 40-43 and 134-137 

Tissue factor proteins. 

amino acids 92-119 

Integrins alpha chain proteins 

amino acids 232-2 62 
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FIGURE 143 

TCCTGCTGATGCACATCTGGGTTTGGC.AAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATT7TGGAAAGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTC7 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCC 
ACATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 
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FIGU RE 144 

CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGA 

GGCAGGAGAGGAGGAGGTGGGGAGAGCACGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGA 

GGAGGTGAAGAAGGAGAGAGGGGAGAAGAGGCAGGAGCTGGAAAGGAGAGAGGGAGGAGGAG 

GAGGAGATGCGGGATGGAGACCTGGAGTTAGGTGGCTTGGGAGAGCTTAATGAAAAGAGAAC 

GGAGAGGAGGTGTGGGTTAGGAACCAAGAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGGA 

GTAGGAAGATCAGGAGCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGCAGAGGAAAGAGG 

AAAGACACAGAGAGACGGGAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCAGTCCCTG 

GCTGCTTTGGCATTTGGGGAACTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCT 

GGAGGGACAGGGTCCCAGAAGGAGGGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGG 

CAGGGGTCCCTCGGAGGCCTCCTGGGGMGGGGGCTGCAGCTCGTCTGAGCGCCCC7CGAGC 

GCTGGTACTCTGGGCTGCACTGGGGGCAGCAGCTCACATCGGACCAGCACCTGACCCCGAGG 

ACTGGTGGAGCTACAAGGATAATCTCCAGGGAAACTTCGTGCCAGGGCCTCCTTTCTGGGGC 

CTGGTGAATGCAGCGTGGAGTCTGTGTGCTGTGGGGAAGCGGCAGAGCCCCGTGGATGTGGA 

GCTGAAGAGGGTTCTTTATGACCCCTTTCTGCCCCCATTAAGGCTCAGCACTGGAGGAGAGA 

AGCTCCGGGGAACCTTGTACAACACCGGCCGACATGTCTCCTTCCTGCCTGCACCCCGACCT 

GTGGTCAATGTGTCTGGAGGTGCCCTCCTTTACAGCCACCGACTCAGTGAACTGCGGCTGCT 

GTTTGGAGCTCGCGACGGAGCCGGCTCGGAACATCAGATCAACCACCAGGGCTTCTCTGCTG 

AGGTGCAGCTCATTCACTTCAACCAGGAACTCTACGGGAATTTCAGCGCTGCCTCCCGCGGC 

CCCAATGGCCTGGCCATTCTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAACCCATTCCT 

CAGTCGCCTCCTTAACCGCGACACCATCACTCGCATCTCCTACAAGAATGATGCCTACTTTC 

TTCAAGACCTGAGCCTGGAGCTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGGGC 

TCTCTCAGCACCCCGCCCTGCTCCGAGACTGTCACCTGGATCCTCATTGACCGGGCCCTCAA 

TATCACCTCCCTTCAGATGCACTCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCT 

TCCAGAGCCTCAGCGGTAACAGCCGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGC 

AACAGGGACCCCCGGCACCCCGAGAGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGA 

TGGTGTCCCCCATGGTCGCTSAGACTCCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTC 

CCCACAAGGCGAGGGGAGTTACCCCTAAAACAAAGCTATTAAAGGGACAGAATACTTA 



WO 99/4628! PCT/US99/05028 

FIGURE 145 

</usr/seqdb2/sst/DNA/Dnaseqs «min/ss . DNA34353 
<subunit I of 1, 328 aa, 1 stop 
<MW: 36238, pi: 9.90, NX(S/T): 3 
MGAAARLSAPRALVLWAALGAAAKIGPAPDPEDWWSYK^ 

AVGKRQSPVDVELKRVLYDPFLPPLRLSTGGEKLRGTLYNTGRHVS FLPAPRPWNVSGGPL 
LYSHRLSSLRLLFGARDGAGSEHQINHQGFSAEVQLIHFNQELYGNFSAASRGPNGLAILSL 
FVNVASTSNPFLSRLLNRDTITRISYKNDAYFLQDLSLELLFPESFGFITYQGSLSTPPCSE 
TVTW I L I DRALN ITS LQMHS LRLLS QNPPSQI FQS LSGNSRP LQPLAHRALRGNRDPRHPER 
RCRGPNYRLHVDGVPHGR 



Important features : 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 
amino acids 177-199 

N-glycosylation site. 

amino acids 118-121, 170-173 and 260-263 

EuJcaryotic-type carbonic anhydrases proteins 

amino acids 222-270, 128-164 and 45-92 
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FIGURE 146 

GGCGCCTGGTTCTGCGCGTACTGGCTGTACGGAGCAGGAGCAAGAGGTCGCCGCCAGCCTCC 
GCCGCCGAGCCTCGTTCGTGTCCCCGCCCCTCGCTCCTGCAGCTACTGCTCAGAAACGCTGG 
GGCGCCCACCCTGGCAGACTAACGAAGCAGCTCCCTTCCCACCCCAACTGCAGGTCTAATTT 
T GGAC GCTTTGCCTGC CAT T T CT T C C AGGT T GAGGGAGCC GC AGAGGC G GAGGC T CGC GT AT 
TCCTGCAGTCAGCACCCACGTCGCCCCCGGACGCTCGGTGCTCAGGCCCTTCGCGAGCGGGG 
CTCTCCGTCTGCGGTCCCTTGTGAAGGCTCTGGGCGGCTGCAGAGGCCGGCCGTCCGGTTTG 
G C T C AC C T C T CC C AG GAAAC TT CACAC T GGAGAGC C AAAAGGAG T GGAAGAGC CTGTCTTGG 

AGATTTTCCTGGGGAAATCCTGAGGTCATTCATTAISAAGTGTACCGCGCGGGAGTGGCTCA 
GAGTAACCACAGTGCTGTTCATGGCTAGAGCAATTCCAGCCATGGTGGTTCCCAATGCCACT 
T TAT T GGAGAAAC TT T T GGAAAAATACATGGATGAGGATGGTGAGTGGT GGATAGC CAAACA 
AC G AGGGAAAAG GGC CAT C AC AG ACAAT G AC AT GCAGAGT AT T T T GGAC C T TC AT AATAAAT 
TACGAAGTCAGGTGTATCCAACAGCCTCTAATATGGAGTATATGACATGGGATGTAGAGCTG 
GAAAGATCTGCAGAATCCTGGGCTGAAAGTTGCTTGTGGGAACATGGACCTGCAAGCTTGCT 
T C CAT C AAT T GG AC AGAAT T T GGGAGCACAC T GGGGAAGATATAGG C CC C C G AC GT T T CAT G 
T ACAAT C G T G G TAT GAT G AAGT GAAAGACT TTAGC TAC C CAT AT GAACAT GAAT G CAAC C CA 
TAT T GT C CAT T C AGG TGTTCTGGCCCTG TAT G TACAC AT TAT AC ACAGG T C G T G T GGGCJ^C 
TAG TAAC AGAAT CGGTTGTGC CAT T AAT T T GT G T CAT AACAT GAACAT C T GGGG GC AGATAT 
GGCCCAAAGCTGTCTACCTGGTGTGCAATTACTCCCCAAAGGGAAACTGGTGGGGCCATGCC 
CCTTACAAACATGGGCGGCCCTGTTCTGCTTGCCCACCTAGTTTTGGAGGGGGCTGTAGAGA 
AAATCTGTGCTACAAAGAAGGGTCAGACAGGTATTATCCCCCTCGAGAAGAGGAAACAAATG 
AAA T AG AAC GAC AGC AG T C ACAAGT C CAT GACAC C CATGT C C G GAC AAGAT CAGAT GATAG T 
AGCAGAAATGAAGTCATAAGCGCACAGCAAATGTCCCAAATTGTTTCTTGTGAAGTAAGATT 
AAGAGATCAGTGCAAAGGAACAACCTGCAATAGGTACGAATGTCCTGCTGGCTGTTTGGATA 
GT AAAG C TAAAGT TAT T G G C AGT GTACAT TAT GAAAT G CAAT C C AGC AT CT GT AGAGCT GCA 
ATT CAT TAT GGT AT AATAGACAATGAT GGTGGCTGGGTAGATAT CACTAGACAAGGAAGAAA 
GCATTATTTCATCAAGTCCAATAGAAA.TGGTATTCAAACAATTGGCAAATATCAGTCTGCTA 
AT T C C T T CAC AG T C T C T AAAG TAACAG T TCAGGC T GTGAC T T GT GAAAC AAC T GTGGAACAG 
CTCTGTCCATTTCATAAGCCTGCTTCACATTGCCCAAGAGTATACTGTCCTCGTAACTGTAT 
G CAAGC AAA! C CAC AT TAT G C T C G T GT AAT T GGAACT C GAGT T TAT T C T GAT C T G T CC AGT A 
TCTGCAGAGCAGCAGTACATGCTGGAGTGGTTCGAAATCACGGTGGTTATGTTGATGTAATG 
C C T G T G G AC AAAAGAAAGAC CT AC AT TGCTTCTTTT CAGAAT GGAAT C T T C T C AGAAAGTT T 

ACAGAATCCTCCAGGAGGAAAGGCATTCAGAGTGTTTGCTGTTGTG3SSAAACTGAATACTTG 
GAAGAGGACCATAAAGACTATTCCAAATGCAATATTTCTGAATTTTGTATAAAACTGTAACA 
T TAC T G T AC AGAG TAC AT C AAC TATTT T CAGCCCAAAAAGGT GC CAAAT GC AT ATAAAT CT T 
GATAAACAAAGTCTATAAAATAAAACATGGGACATTAGCTTTGGGAAAAGTAATGAAAATAT 
AAT GGT T T T AGAAAT C C T G T G T T AAAT AT T GCTATATT T TC T T AGC AGTTAT T T CTACAGT T 
AAT TAC AT AG T CAT GAT T G T T C TAC G T T T CATATAT TATAT GGT GC T T T GTATAT G C CAC TA 
AT AAAAT GAAT C T AAAC AT T GAAT G T GAAT GGCCCT C AGAAAAT CAT CT AGT GC AT T T AAAA 
AT AAT C GAC T C T AAAAC T G AAAG AAAC C T TAT CAC AT T T T C C C C AG T T CAAT GC TAT G C CAT 
TAC CAAC T C CAAAT AAT CT CAAAT AATT T T C C ACT TAATAAC T GT AAAG TTTTTTTCTGT T A 
AT T T AG GC AT AT AGAATAT T AAAT T C TGATAT T GCAC T T C T TAT T T TATAT AAAAT AAT C C T 
T TAAT AT C CAAAT GAAT CT G T TAAAAT GTTT GAT T C C TT GGGAAT GGC CT T AAAAATAAAT G 
TAATAAAGTCAGAGTGGTGGTATGAAAACATTCCTAGTGATCATGTAGTAAATGTAGGGTTA 
AGCAT GGAC AGC C AG AG CT T T C TAT GTACT G T TAAAAT T G AGG T CACATAT TT T C T T T T GTA 
TCCTGGCAAATACTCCTGCAGGCCAGGAAGTATAATAGCAAAAAGTTGAACAAAGATGAACT 
AATGTATTACATTACCATTGCCACTGATTTTTTTTAAATGGTAAATGACCTTGTATATAAAT 
AT TGC C AT AT CAT GG T ACC TAT AAT GGT GAT AT AT T T G T T T C TAT GAAAAATG T AT T GT GCT 
TTGATACTAAAAATCTGTAAAATGTTAGTTTTGGTAATTTTTTTTCTGCTGGTGGATTTACA 
TATTAAATTTTTTCTGCTGGTGGATAAACATTAAAATTAATCATGTTTCAAAAAAAAAAAAA 
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FIGUR E 147 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA45417 
<subunit 1 of I, 500 aa, 1 stop 
<MW: 56888, pi: 8.53, NX(S/T): 2 
MKCTAREWLRVTTVLFMARkIP 

QS ILDLHNKLRSQVYPTASNMEYMTWDVELERSAESWAJESCLWEHGPASLLPS IGQNLGAHW 
GRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLC 
HNMNIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYKEGSDRY 
YPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVI5AQQMSQIVSCEVRLRDQCKGTTCNR 
YE CPAGCLDS KAKVI GS VHYEMQS S I CRAAIHYG 1 1 DNDGGWVD I TRQGRKHYFI KSNRNGI 

QT IGKYQSANS FTVSKVTVQAVTCETTVEQLCPFHKPASHCPRVYCPRNCMQANPHYARVIG 
TRVYS DLS S I CRAAVHAGWRNHGGYVDVMPVDKRKT YI AS FQNG I FS E S L QNP P GGKAFRV 
FAW 

Important features: 
Signal peptide: 

amino acids 1-20 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 protein 

amino acids 165-186, 196-218, 134-146, 96-108 and 58-77 

N-glycosylation site 

amino acids 2 8-31 
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FIGUR E 148 



GCGGAGACAAGCGCAGAGCGCAGCGCACGGCCACAGACAGCCCTGGGCATCCACCGACGGCG 

CAGCCGGAGCCAGCAGAGCCGGAAGGCGCGCCCCGGGCAGAGAAAGCCGAGCAGAGCTGGGT 

GGCGTCTCCGGGCCGCCGCTCCGACGGGCCAGCGCCCTCCCCAISTCCCTGCTCCCACGCCG 

CGCCCCTCCGGTCAGCATGAGGCTCC7GGCGGCCGCGCTGCTCCTGCTGCTGCTGGCGCTGT 

ACACCGCGCGTGTGGACGGGTCCAAATGCAAGTGCTCCCGGAAGGGACCCAAGATCCGCTAC 

AGCGACGTGAAGAAGCTGGAAATGAAGCCAAAGTACCCGCACTGCGAGGAGAAGATGGTTAT 

CATCACCACCAAGAGCGTGTCCAGGTACCGAGGTCAGGAGCACTGCCTGCACCCCAAGCTGC 

AGAGCACCAAGCGCTTCATCAAGTGGTACAACGCCTGGAACGAGAAGCGCAGGGTCTACGAA 

GAAjEa^SGGTGAAAAACCTCAGAAGGGAAAACTCCAAACCAGTTGGGAGACTTGTGCAAAGGA 

CTTTGCAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAflAAAAAAAAAGCCTTTC 

TTTCTCACAGGCATAAGACACAAATTATATATTGTTATGAAGCACTTTTTACCAACGGTCAG 

TTTTTACATTTTATAGCTGCGTGCGAAAGGCTTCCAGATGGGAGACCCATCTCTCTTGTGCT 

CCAGACTTCATCACAGGCTGCTTTTTATCAAAAAGGGGAAAACTCATGCCTTTCCTTTTTAA 

AAAATGCTTTTTTGTATTTGTCCATACGTCACTATACATCTGAGCTTTATAAGCGCCCGGGA 

GGAACAATGAGCTTGGTGGACACATTTCATTGCAGTGTTGCTCCATTCCTAGCTTGGGAAGC 

TTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTCTCCTGGGCTTATGGCCG 

GTCACAGCCTCAGTGTGACTCCACAGTGGCCCCTGTAGCCGGGCAAGCAGGAGCAGGTCTCT 

CTGCATCTGTTCTCTGAGGAACTCAAGTTTGGTTGCCAGAAAAATGTGCTTCATTCCCCCCT 

GGTTAATTTTTACACACCCTAGGAAACATTTCCAAGATCCTGTGATGGCGAGACAAATGATC 

CTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCACAGGTTCAATA 

TTTAATGCTTCAGAAGCATGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAGAAAACT 

TAAAAATAT AT GAATAC AT G C GC AAT ACACAGC TACAGACACACAT T C T GT TGACAAGGGAA 

AACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACTAAAGCAATATAT 

TTGTGATTCCCCATGTAATTCTTCAATGTTAAACAGTGCAGTCCTCTTTCGAAAGCTAAGAT 

GACCATGCGCCCTTTCCTCTGTACATATACCCTTAAGAACGCCCCCTCCACACACTGCCCCC 

CAGTATATGCCGCATTGTACTGCTGTGTTATATGCTATGTACATGTCAGAAACCATTAGCAT 

TGCATGCAGGTTTCATATTCTTTCTAAGATGGAAAGTAATAAAATATATTTGAAATGTAAAA 
AAAAAAAAAAA 




WO 99/46281 PCT/US99/05028 

FIGURE 149 

MSLLPRRAPPVSMRLIJ^lALL 

CEEKMVI I TTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYES 
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FIGURE 150 

GCCCCAGGGACTGCTATGGCTTCCTTTGTTGTTCACCCCGGTCTGCGTCa^TTAAACTCCA 
ATGTCCTCCTGTGGTTAA.CTGCTCTTGCCATCAAGTTCACCCTCATTGACAGCCAAGCACAG 
TAT C GAG T T G T C AAC AC AAATTAT GGCAAAAT C C GGGGC CTAAGAACAC C GT T AC C CAAT GA 
GAT C T T GGGT C C AG T G GAG C AGTAC TT AGGGGTC C CCTAT GC CT C AC CC C CCACT GGAGAGA 
GGCGGTTTCAGCCCCCAGAACCCCCGTCCTCCTGGACTGGCATCCGAAATACTACTCAGTTT 
GCTGCTGTGTGCCCCCAGCACCTGGATGAGAGATCCTTACTGCATGACATGCTGCCCATCTG 
G TT TAG C GC CAAT T T GGAT ACTTTGAT GACC TAT G TTCAAGAT CAAAATGAAGACT GCCTTT 
AC T TAAAC AT C TAG G T G C C C AC GGAAGAT GGAGC CAACACAAAGAAAAACGCAGAT GATATA 
AC GAGTAAT GAC C GT G G T GAAGACGAAGAT ATTCATGAT CAGAACAGTAAGAAGC CCGTCAT 
GGTCTATATCCATGGGGGATCTTACATGGAGGGCACCGGCAACATGATTGACGGCAGCATTT 
TGGCAAGCTACGGAAACGTCATCGTGATCACCATTAACTACCGTCTGGGAATACTAGGGTTT 
TTAAGTACCGGTGACCAGGCAGCAAAAGGCAACTATGGGCTCCTGGATCAGATTCAAGCACT 
GCGGTGGATTGAGGAGAATGTGGGAGCCTTTGGCGGGGACCCCAAGAGAGTGACCATCTTTG 
GCTCGGGGGCT GGGGC CTCCTGTGTCAGCCTGTTGACCCTGTCCCACTACTCAGAAGGTCTC 
T T C C AGAAGG C CAT CAT T C AGAGC G GC AC CGCCCTGTC C AGC T G GGCAGT GAACTAC CAGC C 
GGC CAAG TACAC T C GGAT AT TGGCAGACAAGGT CGGCTGCAACATGCTGGACACCACGGACA 
T GG T AGAAT G C C T G C G GAAC AAGAAC TACAAGGAGC TCAT C C AGC AGAC CAT CAC C C C GGC C 
AC C T AC C ACATAGC CTTCGGGCCGGT GAT C GAC GGC GAC G T CAT C C CAGAC GAC C C C C AGAT 
C C T GAT G GAGC AAG G C GAG T T C C T CAAC TAC GACAT CAT GCTGGGCGT CAAC CAAGGGGAAG 
GCCTGAAGTTCGTGGACGGCATCGTGGATAACGAGGACGGTGTGACGCCCAACGACTTTGAC 
TTCTCCGTGTC CAAC T T C G T GGACAACC TT TACGGCTAC C C TGAAGGGAAAGACACTTT GCG 
GGAGACT ATCAAGT T CAT GT ACACAGACTGGGCCGATAAGGAAAACCCGGAGACGCGGCGGA 
AAACCCTGGTGGCTCTCTTTACTGACCACCAGTGGGTGGCCCCCGCCGTGGCCGCCGACCTG 
CACGCGCAGTACGGCTCCCCCACCTACTTCTATGCCTTCTATCATCACTGCCAAAGCGAAAT 
GAAGCCCAGCTGGGCAGATTCGGCCCATGGTGATGAGGTCCCCTATGTCTTCGGCATCCCCA 
TGATCGGTCCCACCGAGCTCTTCAGTTGTAACTTTTCCAAGAACGACGTCATGCTCAGCGCC 
GTGGTCATGACCTACTGGACGAACTTCGCCAAAACTGGTGATCCAAATCAACCAGTTCCTCA 
G GAT AC CAAG T T CAT T C AC ACAAAAC C CAACC GC T T T GAAGAAGT GGCC T GG T C CAAGTATA 
AT C C C AAAG AC C AG CT C T AT C T GCAT AT T GG CT T GAAACC CAG AG T GAG AG AT CAC T ACCGG 
GCAACGAAAGTGGCTTTCTGGTTGGAACTCGTTCCTCATTTGCACAACTTGAACGAGATATT 
C CAG TAT GT T T C AAC AAC CAC AAAG GT T CCT C CAC CAGAC AT GACAT CAT T T C C CT AT GGC A 
CCCGGCGATCTCCCGCCAAGATATGGCCAACCACCAAACGCCCAGCAATCACTCCTGCCAAC 
AAT C C C AAACAC T C TAAGGAC CCTCACAAAACAGGGCCT GAGGACACAACTGT CCT CAT TGA 
AACCAAACGAGATTATTCCACCGAATTAAGTGTCACCATTGCCGTCGGGGCGTCGCTCCTCT 
TCCTCAACATCTTAGCTTTTGCGGCGCTGTACTACAAAAAGGACAAGAGGCGCCATGAGACT 
CACAGGCGCCCCAGTCCCCAGAGAAACACCACAAATGATATCGCTCACATCCAGAACGAAGA 
GATCAT G T C T C T G CAGAT GAAGCAGC T GGAACAC GATCAC GAGT GTGAGTCGC T GCAGGCAC 
AC GAC AC AC T G AGG C T CAC C T GCCC GCC AGACTAC ACC C T CACGC T GCG CCGG T C GC CAGAT 
GACAT C C CAC T TAT GAC G C CAAACAC CAT CAC CAT GAT T C CAAACACAC T GAC GGGGAT G CA 
GCCT T T GC ACACT T T T AAC ACCTTCAGT GGAGGACAAAACAGTACAAATTTAC CCCACGGAC 
ATTCCACCACTAGAGTAl^SCTTTGCCCTATTTCCCTTCCTATCCCTCTGCCCTACCCGCTC 
AGCAAC AT AGAAGAGG GAAG GAAAGAGAGAAGGAAAGAGAGAGAGAAAGAAAG T C T C CAGAC 
CAGGAATGTTTTTGTCCCACTGACTTAAGACAAAAATGCAAAAAGGCAGTCATCCCATCCCG 
GCAGAC C C TTAT CGTTGGTGTTTTC CAG TAT TACAAGAT CAAC TTCTGACCCT GT GAAATGT 
GAGAAGTACACATTTCTGTTAAAATAACTGCTTTAAGATCTCTACCACTCCAATCAATGTTT 
AGTGTGATAGGACATCACCATTTCAAGGCCCCGGGTGTTTCCAACGTCATGGAAGCAGCTGA 
CAC T T C T G AAAC T CAGC C AAGGACAC T T G AT ATT T TTTAAT T ACAAT GGAAGT TTAAACAT T 
TCTTTCTGTGC CAC AC AAT GGATGGC T CT C C T T AAGT GAAGAAAGAGT C AAT GAGAT T T T GC 
CCAGCACATGGAGCT GTAAT CCAGAGAGAAGGAAACGTAGAAATTTATTATTAAAAGAATGG 
ACTGTGCAGCGAAATCTGTACGGTTCTGTGCAAAGAGGTGTTTTGCCAGCCTGAACTATATT 
TAAGAGACTTTGT 
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FIGU R E 151 

MLNSNVLLWLTAIJVIKFTLIDSQAQYPVVNTNYGKIRGLRTPLPNEILGPVEQYLGVPYASP 

PTGERRFQPPEPPSSWTGIRNTTQFAAVCPQHLDERSLLHDMLPIWFTANLDTLMTYVQDQN 

EDCLYLNIYVPTEDGANTKKNADDITSNDRGEDEDIKDQNSKKPVMVYIHGGSYlffiGTGmi 

DGSILASYGNVIVITINYRLGILGFLSTGDQAAKGNYGLLDQIQALRWIEENVGAFGGDPKR 
VTIFGSGAGASCVSLLTLSHYSEGLFQKAIIQSGTALSS 

DTTDMVECLRNKNYKEL I QQT I TPATYHIAFGPVI DGDVIPDDPQILMEQGEFLNYDIMLGV 
NQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIK5MYTDWADKENP 
ETRRKTLVALFTDHQWVAPAVAADLHAQYGSPTYFYAFYHHCQSEMKPSWADSAHGDEVPYV 
FGIPMIGPTELFSCNFSKNDVMLSAWMTYWTNFAKTGDPNQPVPQDTKFIHTKPNRFEEVA 
WSKYNPKDQLYLHIGLKPRVRDHYRATKVAFWLELVPHLHNLNEIFQYVSTTTKVPPPDMTS 
FPYGTRRSPAKIWPTTKRPAITPANNPKHSKDPHKTGPEDTTVLIETKRDYSTELSVTIAVG 
ASLLFLNILAFAALYYKKDKRRHETHRRPSPQRNTTNDIAHIQNEEIMSLQMKQLEHDHECE 

SLQAHDTLRLTCPPBYTLTLRRSPDDIPLMTPNTITMIPNTLTGMQPLHTFNTFSGGQNSTN 
LPHGHSTTRV 



WO 99/46281 



PCT/US99/05028 



FIGURE 1 52 

GGGAAAG&ISGCGGCGACTCTGGGACCCCTTGGGTCGTGGCAGCAGTGGCGGCGATGTTTGT 
CGGCTCGGGATGGGTCCAGGATGTTACTCCTTCTTCTTTTGTTGGGGTCTGGGCAGGGGCCA 
CAGCAAGTCGGGGCGGGTCAAACGTTCGAGTACTTGAAACGGGAGCACTCGCTGTCGAAGCC 
CTACCAGGGTGTGGGCACAGGCAGTTCCTCACTGTGGAATCTGATGGGCAATGCCATGGTGA 
TGACCCAGTATATCCGCCTTACCCCAGATATGCAAAGTAAACAGGGTGCCTTGTGGAACCGG 
GTGCCATGTTTCCTGAGAGACTGGGAGTTGCAGGTGCACTTCAAAATCCATGGACAAGGAAA 
GAAGAATCTGCATGGGGATGGCTTGGCAATCTGGTACACAAAGGATCGGATGCAGCCAGGGC 
CTGTGTTTGGAAACATGGACAAATTTGTGGGGCTGGGAGTATTTGTAGACACCTACCCCAAT 
GAGGAGAAGCAGCAAGAGCGGGTATTCCCCTACATCTCAGCCATGGTGAACAACGGCTCCCT 
CAGCTATGATCATGAGCGGGATGGGCGGCCTACAGAGCTGGGAGGCTGCACAGCCATTGTCC 
GCAATCTTCATTACGACACCTTCCTGGTGATTCGCTACGTCAAGAGGCATTTGACGATAATG 
ATGGATATTGATGGCAAGCATGAGTGGAGGGACTGCATTGAAGTGCCCGGAGTCCGCCTGCC 
CCGCGGCTACTACTTCGGCACCTCCTCCATCACTGGGGATCTCTCAGATAATCATGATGTCA 
TTTCCTTGAAGTTGTTTGAACTGACAGTGGAGAGAACCCCAGAAGAGGAAAAGCTCCATCGA 
GATGTGTTCTTGCCCTCAGTGGACAATATGAAGCTGCCTGAGATGACAGCTCCACTGCCGCC 
CCTGAGTGGCCTGGCCCTCTTCCTCATCGTCTTTTTCTCCCTGGTGTTTTCTGTATTTGCCA 

TAGTCATTGGTATCATACTCTAGAACAAA.TGGCAGGAACAGAGCCGAAA.GCGCTTCTACIS& 
GCCCTCCTGCTGCCACCACTTTTGTGACTGTCACCCATGAGGTATGGAAGGAGCAGGCACTG 
GCCTGAGCATGCAGCCTGGAGAGTGTTCTTGTCTCTAGCAGCTGGTTGGGGACTATATTCTG 
TCACTGGAGTTTTGAATGCAGGGACCCCGCATTCCCATGGTTGTGCATGGGGACATCTAACT 
CTGGTCTGGGAAGCCACCCACCCCAGGGCAATGCTGCTGTGATGTGCCTTTCCCTGCAGTCC 
TTCCATGTGGGAGCAGAGGTGTGAAGAGAATTTACGTGGTTGTGATGCCAAAATCACAGAAC 
AGAATTTCATAGCCCAGGCTGCCGTGTTGTTTGACTCAGAAGGCCCTTCTACTTCAGTTTTG 
AATCCACAAAGAATTAAAAACTGGTAACACCACAGGCTTTCTGACCATCCATTCGTTGGGTT 
TTGCATTTGACCCAACCCTCTGCCTACCTGAGGAGCTTTCTTTGGAAACCAGGATGGAAACT 
TCTTCCCTGCCTTACCTTCCTTTCACTCCATTCATTGTCCTCTCTGTGTGCAACCTGAGCTG 
GGAAAGGCATTTGGATGCCTCTCTGTTGGGGCCTGGGGCTGCAGAACACACCTGCGTTTCAC 
TGGCCTTCATTAGGTGGCCCTAGGGAGATGGCTTTCTGCTTTGGATCACTGTTCCCTAGCAT 
GGGTCTTGGGTCTATTGGCATGTCCATGGCCTTCCCAATCAAGTCTCTTCAGGCCCTCAGTG 
AAGTTTGGCTAAAGGTTGGTGTAAAAATCAAGAGAAGCCTGGAAGACATCATGGATGCCATG 
GATTAGCTGTGCAACTGACCAGCTCCAGGTTTGATCAAACCAAAAGCAACATTTGTCATGTG 
GTCTGACCATGTGGAGATGTTTCTGGACTTGCTAGAGCCTGCTTAGCTGCATGTTTTGTAGT 
TACGATTTTTGGAATCCCACTTTGAGTGCTGAAAGTGTAAGGAAGCTTTCTTCTTACACCTT 
GGGCTTGGATATTGCCCAGAGAAGAAATTTGGCTTTTTTTTTCTTAATGGACAAGAGACAGT 
TGCTGTTCTCATGTTCCAAGTCTGAGAGCAACAGACCCTCATCATCTGTGCCTGGAAGAGTT 
CACTGTCATTGAGCAGCACAGCCTGAGTGCTGGCCTCTGTCAACCCTTATTCCACTGCCTTA 
TTTGACAAGGGGTTACATGCTGCTCACCTTACTGCCCTGGGATTAAATCAGTTACAGGCCAG 
AGTCTCCTTGGAGGGCCTGGAACTCTGAGTCCTCCTATGAACCTCTGTAGGCTAAATGAAAT 

TCTTAAAATCACCGATGGAACCAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGTAGGGATAACAGGGTAATAAGCTTGGCCGCCATGG 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss,DNA50911 
Xsubunit 1 of 1, 348 aa, 1 stop 
XMW: 39711, pi: 8.70, NX{S/T): 1 

MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGFQQVGAGQTFEYLKREHSLSKPYQ 
GVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDW 

LHGDGLAIWYTKDRMQPGPVFGNMDKFVGL^^ 

DHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMDIDGXHEWRDCIEVPGVRLPRG 
YYFGTS S I TGDLS DNHDVI S LKLFELTVERTPEEEKLHRDVFLPS VDNMKLPEMTAPLPPLS 
GLAL FL IVFFS LVFS VFAI VI G 1 1 L YNKWQEQSRKRFY 
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WO 99/46281 



PC7/US99/05028 



FIGUR E 154 

CCGAGCCGGCCGCGCAGCGACGGAGCTGGGGCCGGCCTGGGACCATGGGCGTGAGTGCAATC 

TACGGATCAGTCTCTGATGGTGGGTCGTTAACCTCAGTGGGGACTCCAAGATTTCC^TGAAG 
AAAAT CAG T T G T C T T CAT T C AAGAAT TGGGGTCTGGCT CAGAAT T C C T GCAGC T G G TGAAAA 

TCTGTTTTCTAGAAGAGGTTTAATTAATGCCTGCAGTCTGACATGTTCCCGATTTGAGGTGA 
AA.C CAT GAAGAG AAAAT AGAATAC T T AATAAT£C T T T T C C GCAAC C GCT T C TTGC T GC T GC T 

GGCCCTGGCTGCGCTGCTGGCCTTTGTGAGCCTCAGCCTGCAGTTCTTCCACCTGA^CCCGG 
T GT C GAC T C C TAAGAAT GGAATGAGTAGCAAGAGTC GAAAGAGAATCATGCCC GAC CCT GT G 
ACGGAGCCCCCTGTGACAGACCCCGTTTATGAAGCTCTTTTGTACTGCAACATCCCCAGTGT 
GGCCGAGCGCAGCATGGAAGGTCATGCCCCGCATCATTTTAAGCTGGTCTCAGTGCATGTGT 
TCATTCGCCACGGAGACAGGTACCCACTGTATGTCATTCCCAAAACAAAGCGACCAGAAATT 
GACTGCACTCTGGTGGCTAACAGGAAACCGTATCACCCAAAACTGGAAGCTTTCATTAGTCA 

CATGTCAAAA.GGATCCGGAGCCTCTTTCGAAAGCCCCTTGAACTCCTTGCCTCTTTACCCAA 
AT C AC C CAT T G T G T G AGAT G G GAGAGC T C AC AC AGAC AGGAGT T GT GCAGCAT TT GCAGAAC 

GGTCAGCTGCTGAGGGATATCTATCTAAAGAAACACAAACTCCTGCCCAATGATTGGTCTGC 
AGAC CAG C T C TAT T T AGAG AC C AC T GGGAAAAGC C G GAC C C T ACAAAGTGGGC T GGCCTTGC 
T T TAT GGCTTTCTCC CAG AT T T T GAC T GGAAGAAGATT T AT T T CAGGCACCAGC CAAGT GC G 
CTGTTCTGCTCTG G AAG C T G C TAT T GC CCG GTAAGAAAC C AGTATC T GGAAAAGGAGC AGC G 
T C G T CAG T AC C T C C T AC G T T T GAAAAACAG C CAGC T G GAGAAGAC C TAC GGGGAGAT GG C CA 
AGATCGTGGATGTCCCCACCAAGCAGCTTAGAGCTGCCAACCCCATAGACTCCATGCTCTGC 
CACTTCTGCCACAATGTCAGCTTTCCCTGTACCAGAAATGGCTGTGTTGACATGGAGCACTT 
CAAGGTAATTAAGACCCATCAGATCGAGGATGAAAGGGAAAGACGGGAGAAGAAATTGTACT 
TCGGGTATTCTCTCCTGGGTGCCCACCCCATCCTGAACCAAACCATCGGCCGGATGCAGCGT 
GCCACCGAGGGCAGGAAAGAAGAGCTCTTTGCCCTCTACTCTGCTCATGATGTCACTCTGTC 
ACCAGTTCTCAGTGCCTTGGGCCTTTCAGAAGCCAGGTTCCCAAGGTTTGCAGCCAGGTTGA 
T C T T T GAG C T T T GG C AAGAC AGAGAAAAGC C CAGT GAAC AT T C C GT C C GGAT T C T T TAC AAT 
GGC G T C GAT G T CACAT T C CACACCTC TTTC TGC CAAGAC CACCACAAGCGTT CT CCCAAGCC 
CATGTGCCCGCTTGAAAACTTGGTCCGCTTTGTGAAAAGGGACATGTTTGTAGCCCTGGGTG 
GC AG T G G T AC AAAT TAT TAT GAT GCAT GT CAC AGGGAAGGAT T C I&AAAGGTAT GCAG T ACA 
G CAG T AT AGAAT C CAT G C C AAT ACAGAGCAT AG G GAAAGG TC CAC T T CT AGT TTTGTCTGTT 
ACTAAGGGTAGAAGATTATTGCTTTTTAAAGGCTAAATATTGTTTGTGGGAACCACAGATGG 
TTGGGGTTGAACAGTAAGCACATTGCTGCAATGTGGTACGTGAATTGCTTGGTACAAAA.TGG 
CCAGTTCACAGAGGAATAGAAGGTACTTTATCATAGCCAGACTTCGCTTAGAATGCCAGAAT 
AATATAGTTCAAGACCTGAAGTTGCCAATCCAAGTTTGCACTCTTCTGGCCTGCCCCATGTT 
AC TAT G T GAT G GAAC CAG C ACAC C T CAAC CAAAA.T T T T T T T AATC TTAGACAT T T T TAC C T T 

GTCCTTGTTAAGAATTTCTTGAAGTGATTTATCTAAAATAAAGGTTGGCAAACTTTTTCTGT 
AAAG GGC CAG AT T G T AAAT AT T T CAGAC T G T G T GGAC C AAAAGGC C ACATACAG TCTCTGTC 
ATAAC TAC TCAACTC TGTTT CTGAAGCAGGAAAGCCACCACAGACAGTACATAAA.GGAATAT 
GTGTAGCTGGGTTCCCAGGCCAGACAAAACAGATGGTGACCAGACTTGGCCCCTGGGCTGTA 
GT T T GC T GAC C C C T CAT C T AAAAAATAGGC TAT AC T ACAAT TGCACT T C CAGCAC T T T GAGA 
AC GAG T T G AAT AC C AAGAAT TAT TCAAT GGT T C C T C C AGTAAC T T C T GCT AGAAAC ACAGAA 
TTTGGTCTGTATCTGACACTAGAAGAAAACTTGAGGGTAAA.TAAACATTGAATTAGAATGAA 
TCATAGAAAACTGATTAGAAGAATACTTGATGTTTATGATGATTGTGGTACAAGATAGTTTT 
AAGTATGTTCTAAATATTTGTCTGCTGTAGTCTATTTGCTGTATATGCTGAAATTTTTGTAT 
GCCATTTAGTATTTTTATAGTTTAGGAAAATATTTTCTAAGACCAGTTTTAGATGACTCTTA 
TTCCTGTAGTAATATTCAATTTGCTGTACCTGCTTGGTGGTTAGAAGGAGGCTAGAAGATGA 
ATTCAGGCACTTTCTTCCAATAAAACTAATTATGGCTCATTCCCTTTGACAAGCTG^AGAAC 
TGGATTCATTTTTAAACCATTTTCATCAGTTTCAAATGGTAAATTCTGATTGATTTTTAAAT 
GCGTTTTTG GAAGAAC T T T G C TAT TAGGTAGT T T ACAGAT C T T TATAAGGT GT T T T ATATAT 
T AGAAGC AAT T A T AAT TAC AT C T G T GAT T T C T GAAC T AAT GGT GC TAAT T C AGAGAAAT GGA 

AAGTGAAAGTGAGATTCTCTGTTGTCATCGGCATTCCAACTTTTTCTCTTTGTTTTTGTCCA 
G T GT T GCAT T T GAAT AT GTCTGTTTC TATAAA.TAAAT T T T T TAAGAATAA 
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FIGURE 155 

></usr/seqdb2/sst/DNA/Dnaseqs,min/5S.DNA48329 
xsubunit 1 of 1, 480 aa, 1 stop 
><MW: 55240, pi: 9,30, NX(S/T): 2 
MLFRNRFLLLLALA^ 

EALLYCNIPSVAERSMEGHAPHHFKLVSVKVFIRHGDRYPLTVIPKTKRPEIDCTLVANRK? 

YHPKLEAFISHMSKGSGAS FESPLNSLPLYPNHPLCEMGELTQTGWQHLQNGQLLRDIYLK 

KHKLLPNDWSADQLYLETTGKSRTLQSGLALLYGFLPDFDWKKIYFRHQPSALFCSGSCYCP 

VRNQYLEKEQRRQYLLRLKNSQLEKTYGEMAKIVDVPTKQLRAANPIDSMLCHFCHNVSFPC 
TRNGCVDMEHFKVTKTHQ I EDERERREKKL YFGYSLLGAHP I LNQT I GRMQRATEGRKEEL F 

ALYSAHDVTLSPVLSALGLSEARFPRFAARLIFELWQDREKPSEHSVRILYNGVDVTFHTSF 
CQDHHKRSPKPMCPLENLVRFVKRDMFVALGGSGTNYYDACHREGF 
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FIGURE 156 

AAAAAAG C T C AC T AAAG T T T C TAT TAGAGC GAAT AC GGTAGAT T T CCATC C CC T T TT GAAGA 

ACAGTACTGTGGAGCTATTTAAGAGATAAAAACGAAATATCCTTTCTGGGAGTTCAAGATTG 
TGCAGTAATTGGTTAGGACTCTGAGCGCCGCTGTTCACCAATCGGGGAGAGAAAAGCGGAGA 
TCCTGCTCGCCTTGCACGCGCCTGAAGCACAAAGCAGATAGCTAGGAATGAACCATCCCTGG 
GAGTATGTGGAAACAACGGAGGAGCTCTGACTTCCCLAACTGTCCCATTCTATGGGCGAAGGA 
AC T GC T C C T GAC T T C AG T GG T TAAGGGCAGAAT T GAAAATAAT T C T GGAGGAAGATAAGA&T 

SATTCCTGCGCGACTGCACCGGGACTACAAAGGGCTTGTCCTGCTGGGAATCCTCCTGGGGA 
C T C T G T GGGAGAC C G GAT G C AC C CAGATAC GC TAT T C AGT T C C GGAAGAG C T GGAGAAAGGC 

TCTAGGGTGGGCGACATCTCCAGGGACCTGGGGCTGGAGCCCCGGGAGCTCGCGGAGCGCGG 
AGTCCGCATCATCCCCAGAGGTAGGACGCAGCTTTTCGCCCTGAATCCGCGCAGCGGCAGCT 
TGGTCACGGCGGGCAGGATAGACCGGGAGGAGCTCTGTATGGGGGCCATCAAGTGTCAATTA 
AATCTAGACATTCTGATGGAGGATAAAGTGAAAATATATGGAGTAGAAGTAGAAGTAAGGGA 
CAT T AAC G AC AAT G C G C C T TAG T T T C G T GAAAG T GAAT T AGAAATAAAAAT T AGT GAAAAT G 

CAGCCACTGAGATGCGGTTCCCTCTACCCCACGCCTGGGATCCGGATATCGGGAAGAACTCT 
CTGCAGAGCTACGAGCTCAGCCCGAACACTCACTTCTCCCTCATCGTGCAAAATGGAGCCGA 
CGGTAGTAAGTACCCCGAATTGGTGCTGAAACGCGCCCTGGACCGCGAAGAAAAGGCTGCTC 
ACCACCTGGTCCTTACGGCCTCCGACGGGGGCGACCCGGTGCGCACAGGCACCGCGCGCATC 
CGCGTGATGGTTCTGGATGCGAACGACAACGCACCAGCGTTTGCTCAGCCCGAGTACCGCGC 
GAGCGTTCCGGAGAATCTGGCCTTGGGCACGCAGCTGCTTGTAGTCAACGCTACCGACCCTG 
AC GAAGGAG T C AAT G C G GAAG T GAGGT AT T C C T T C C GGT AT G T GGAC GACAAGGC GGC C CAA 
G T T T T CAAAC TAG AT T G T AAT T CAGGGACAATAT C AAC AAT AGGGGAGT T GGAC CAC GAGGA 
GTCAGGATTCTACCAGATGGAAGTGCAAGCAATGGATAATGCAGGATATTCTGCGCGAGCCA 
AAGTCCTGATCACTGTTCTGGACGTGAACGACAATGCCCCAGAAGTGGTCCTCACCTCTCTC 
GC C AGC TCGGTTCCC GAAAAC T CT C C CAGAGGGACAT TAAT TGCCCTTT TAAAT GTAAATGA 

CCAAGATTCTGAGGAAAACGGACAGGTGATCTGTTTCATCCAAGGAAATCTGCCCTTTAAAT 

TAGAAAAATCTTACGGAAATTACTATAGTTTAGTCACAGACATAGTCTTGGATAGGGAACAG 

GTTCCTAGCTACAACATCACAGTGACCGCCACTGACCGGGGAACCCCGCCCCTATCCACGGA 

AACTCATATCTCGCTGAACGTGGCAGACACCAACGACAACCCGCCGGTCTTCCCTCAGGCCT 

CCTATTCCGCTTATATCCCAGAGAACAATCCCAGAGGAGTTTCCCTCGTCTCTGTGACCGCC 

CACGACCCCGACTGTGAAGAGAACGCCCAGATCACTTATTCCCTGGCTGAGAACACCATCCA 

AGGGGCAAGCCTATCGTCCTACGTGTCCATCAACTCCGACACTGGGGTACTGTATGCGCTGA 

GCTCCTTCGACTACGAGCAGTTCCGAGACTTGCAAGTGAAAGTGATGGCGCGGGACAACGGG 

CACCCGCCCCTCAGCAGCAACGTGTCGTTGAGCCTGTTCGTGCTGGACCAGAACGACAATGC 

GCCCGAGATCCTGTACCCCGCCCTCCCCACGGACGGTTCCACTGGCGTGGAGCTGGCTCCCC 
G C T C C GCAGAGC C C G G C T AC C T GGT GAC CAAGG T GG T GGC GGT GGAC AGAGAC T C C G G C C AG 

AACGCCTGGCTGTCCTACCGTCTGCTCAAGGCCAGCGAGCCGGGACTCTTCTCGGTGGGTCT 
GCACACGGGCGAGGTGCGCACGGCGCGAGCCCTGCTGGACAGAGACGCGCTCAAGCAGAGCC 
TCGTAGTGGCCGTCCAGGACCACGGCCAGCCCCCTCTCTCCGCCACTGTCACGCTCACCGTG 
GCCGTGGCCGACAGCATCCCCCAAGTCCTGGCGGACCTCGGCAGCCTCGAGTCTCCAGCTAA 
CTCTGAAACCTCAGACCTCACTCTGTACCTGGTGGTAGCGGTGGCCGCGGTCTCCTGCGTCT 
TCCTGGCCTTCGTCATCTTGCTGCTGGCGCTCAGGCTGCGGCGCTGGCACAAGTCACGCCTG 
CTGCAGGCTTCAGGAGGCGGCTTGACAGGAGCGCCGGCGTCGCACTTTGTGGGCGTGGACGG 
GGTGCAGGCTTTCCTGCAGACCTATTCCCACGAGGTTTCCCTCACCACGGACTCGCGGAAGA 
G T CAC C T GAT C T T C C C C C AG C C CAAC TAT GC AGAC AT GC T C GT CAGC C AGGAGAGC T T T GAA 

AAAAGCGAGCCCCTTTTGCTGTCAGGTGATTCGGTATTTTCTAAAGACAGTCATGGGTTAAT 
TGAGGTGAGTTTATATCAAATCTTCTTTCTTTTTTTTTTTAATTGCTCTGTCTCCCAAGCTG 
GAGTGCAGCGGTACGATCATAGCTCACTGCGGCCTCAAACTCCTAGGCTCAAGCAATTATCC 
CACCTTTGCCTCCGGTGTAACAGGGACTACAGGTGCAAGCCACCTACTGTCTGCCTATCTAT 
C TAT C TAT C TAT C TAT C TAT C TAT C TAT C TAT C TAT C TAT C T ATTAC T T T C T T GT AC AGAC G 

GGAGTCTCACGCCTG TAAT CCCAGTACTTTGGGAGGCCGAGGCGGGTGGATCACCT GAGGT T 

GGGAGTTTGAGACCAGCCTgACCAACATGGAGAAACCCCGTCTATACTAAAAAAATACAAAA 

TTAGCCGGGCGTGGTGGTGCATGTCTGTAATCCCAGCTACTTGGGAGGCTGAGTCAGGAGAA 

TTGCTTTAACCTGGGAGGTGGAGGTTGCAATGAGCTGAGATTGTGCCATTGCACTCCAGCCT 
GGGCAACAAGAGTGAAACTCTATCTCA 
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FIGURE 1 57 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA48306 
xsubunit 1 of 1, 916 aa, 1 srop 
XMW: 100204, pi: 4*92, NX (S/T) : 4 

MIPARLHRDYKGLVLLGILLGTLWETGCTQIRYSVPEELEKGSRVGDISRDLGLEPRELAZR 
GVRIIPRGRTQLFALNPRSGSLVTAGRIDREELCMGAIKCQLNLDILMEDKVKIYGVEVETO 
DINDNAPYFRESELEIKISENAATEMRFPLPHAWDPDIGKNSLQSYELSPNTHFSLIVQNGA 
DGS KY PE L VLKRALDREEKAAHHLVLTAS DGGDPVRT GTAR I RVMVLDANDNAPAFAQPE YR 
AS VP ENLAL GTQL LWNAT DPDE GVNAE VRYS FRYVDDKAAQVFKLDCNS GTISTI GELDHE 
E SG FYQME VQAMDNAG YS ARAKVL I T VLDVNDNAPEWLT S LAS SVPENS PRGTL I ALLNVN 

DQDSEENGQVICFIQGNLPFKLEKSYGNYYSLVTDIVLDREQVPSYNITVTATDRGTPPLST 
ETHIS LNVADTNDNP P VFP QASYSAY I PENNPRGVS LVS VTAHDPDCEENAQ I TYS LAENT I 

QGASLSSYVSINSDTGVLYALSSFDYEQFRDLQVKVMARDNGHPPLSSWSLSLFVLDQNDN 
APEILYPALPTDGSTGVELAPRSAEPGYLVTKWAVDRDSGQNAWLSYRLLKASEPGLFSVG 
LHTGEVRTARALLDRDALKQSLWAVQDHGQPPLSATVTLTVAVADSIPQVLADLGSLESPA 
NSETSDLTLYLWAVAAVSCVFIAFVILLIALRLRRWHKSRLLQASGGGLTGAPASHFVGVD 
GVQAFLQTYSHEVS'LTTDSRKSHLIFPQPNYADMLVSQESFEKSEPLLLSGDSVFSKDSHGL 
IEVSLYQIFFLFFFNCSVSQAGVQRYDHSSLRPQTPRLKQLSHLCLRCNRDYRCKPPTVCLS 
IYLSIYLSIYLSIYLLLSCTDGSLTPVIPVLWEAEAGGSPEVGSLRPA 



WO 99/46281 PCT/US99/05028 

FIGUR E 15 8 

CCCAGGCTCTAGTGCAGGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTAAAAG 

GCTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTCCGAA 

TCAGTAGGTGACCCCGCCCCTGGATTCTGGAAGACCTCACCATSGGACGCCCCCGACCTCGT 

GCGGCCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGC 

ACAGGAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCTTGGCAGGCGG 

CCTTGTTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTT 

ACAGC TGCCCACT GTAAAAAA.CCGAAATACACAGTACGCCTGGGAGACCACAGCCTACAGAA 
T AAAGAT GG C C C AGAGC AAGAAAT AC C TGT GG TT CAG T C CAT C C C ACAC C C C T GC TACAACA 

GCAGCGATGTGGAGGACCACAACCATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCC 
CTGGGGTCCAAAGTGAAGCCCATCAGCCTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTG 
CACCGTCTCAGGCTGGGGCACTGTCACCAGTCCCCGAGAGAATTTTCCTGACACTCTCAACT 
GTGCAGAAGTAAAAATCTTTCCCCAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACA 
GATGGCATGGTCTGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAGG 
CCCCCTGGTGTGTGATGGTGCACTCCAGGGCATCACATCCTGGGGCTCAGACCCCTGTGGGA 
GGTCCGACAAACCTGGCGTCTATACCAACATCTGCCGCTACCTGGACTGGATCAAGAAGATC 
AT AG GC AGC AAGG G C T£&T T CT AGGATAAGCACTAGAT CT CC C T T AATAAAC T CACAACT CT 
CTGGTTC 
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F IG UR E 15 9 



</usr/seqdb2/sst/DNA/Dnaseqs*min/ss .DNA48336 
<subunit 1 of 1, 2 60 aa, 1 stop 
<MW: 28048, pi; 7,87, NX(3/T): 1 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGGVL 
VGGNWVL TAAHCKKPKYT VRLGDHS LQNKDGPE QE I PWQS I ?HPC YNS S DVEDHNHDLMLL 

QLRDQASLGSKVKP I SLADKCTQPGQKCTVSGWGTVTSPRENFPDTLNCAEVKI FPQKKCED 

AYPGQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRY 
LDWIKKIIGSKG 

Important Features: 
Signal peptide: 
amino acids 1-23 

Transmembrane domain: 

amino acids 51-71 

N-glycosylation site, 
amino acids 110-113 

Serine proteases, trypsin family, histidine active site, 
amino acids 69-74 and 207-217 

Tyrosine kinase phosphorylation site, 
amino acids 182-188 

Kringle domain proteins motif 
amino acids 205-217 
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F IG UR E 16 0 



GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGC 

'CGCGCCGCCCACGCCCCAACCCCGGCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGCCCGC 

GCCCGCGCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCCGCCCCCGCCCCG 

CCCCGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAAAC 

ATTCATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCG 

CCCTGTGCGCCCTGCGCGCCCTGCGCACCCGCGGCCCGAGCCCAGCCAGAGCCGGGCGGaGC 

GGAGCGCGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGA 

TGCGGACCCGGCCGCGGGGAGACGGGCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGC 

CCCGCCCAACCCCTACGMSA?VGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTG 

CTGTGGCTGCAGGCCTGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATGCTACAATGA 

GCCCAAGGTGACGACAAGCTGCCCCCAGCAGGGCCTGCAGGCTGTGCCCGTGGGCATCCCTG 

CTGCCAGCCAGCGCATCTTCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCCAGCTTC 

CGTGCCTGCCGCAACCTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGC 

GGCTGCCTTCACTGGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCC 

GGTCTGTGGACCCTGCCACATTCCACGGCCTGGGCCGCCTACACACGCTGCACCTGGACCGC 

TGCGGCCTGCAGGAGCTGGGCCCGGGGCTGTTCCGCGGCCTGGCTGCCCTGCAGTACCTCTA 

CCTGCAGGACAACGCGCTGCAGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCA 

CACACCTCTTCCTGCACGGCAACCGCATCTCCAGCGTGCCCGAGCGCGCCTTCCGTGGGCTG 

CACAGCCTCGACCGTCTCCTACTGCACCAGAACCGCGTGGCCCATGTGCACCCGCATGCCTT 

CCGTGACCTTGGCCGCCTCATGACACTCTATCTGTTTGCCAACAATCTATCAGCGCTGCCCA 

CTGAGGCCCTGGCCCCCCTGCGTGCCCTGCAGTACCTGAGGCTCAACGACAACCCCTGGGTG 

TGTGACTGCCGGGCACGCCCACTCTGGGCCTGGCTGCAGAAGTTCCGCGGCTCCTCCTCCGA 

GGTGCCCTGCAGCCTCCCGCAACGCCTGGCTGGCCGTGACCTCAAACGCCTAGCTGCCAATG 

ACCTGCAGGGCTGCGCTGTGGCCACCGGCCCTTACCATCCCATCTGGACCGGCAGGGCCACC 

GATGAGGAGCCGCTGGGGCTTCCCAAGTGCTGCCAGCCAGATGCCGCTGACAAGGCCTCAGT 

ACTGGAGCCTGGAAGACCAGCTTCGGCAGGCAATGCGCTGAAGGGACGCGTGCCGCCCGGTG 

ACAGCCCGCCGGGCAACGGCTCTGGCCCACGGCACATCAATGACTCACCCTTTGGGACTCTG 

CCTGGCTCTGCTGAGCCCCCGCTCACTGCAGTGCGGCCCGAGGGCTCCGAGCCACCAGGGTT 

CCCCACCTCGGGCCCTCGCCGGAGGCCAGGCTGTTCACGCAAGAACCGCACCCGCAGCCACT 

GCCGTCTGGGCCAGGCAGGCAGCGGGGGTGGCGGGACTGGTGACTCAGAAGGCTCAGGTGCC 

CTACCCAGCCTCACCTGCAGCCTCACCCCCCTGGGCCTGGCGCTGGTGCTGTGGACAGTGCT 

TGGGCCCTGCIS^CCCCCAGCGGACACAAGAGCGTGCTCAGCAGCCAGGTGTGTGTACATAC 

GGGGTCTCTCTCCACGCCGCCAAGCCAGCCGGGCGGCCGACCCGTGGGGCAGGCCAGGCCAG 

GTCCTCCCTGATGGACGCCTGCCGCCCGCCACCCCCATCTCCACCCCATCATGTTTACAGGG 

TTCGGCGGCAGCGTTTGTTCCAGAACGCCGCCTCCCACCCAGATCGCGGTATATAGAGATAT 

GCATTTTATTTTACTTGTGTAAAAATATCGGACGACGTGGAATAAAGAGCTCTTTTCTTAAA 
AAAA 
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FIGURE 161 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss -DNA44184 
Xsubunit 1 of 1, 473 aa, 1 stop 
XMW: 50708, pi: 9.23, NX(S/T): 6 

MKI^SAGGSRLLAWVLWLQAWQVAA.PCPGACYCYNEPKVTTSCPQQGLQAV?VGI?AASQRI 
FLHGNRI SHVPAAS FRACRNLTILWLHS^/LARIDAAAFTGLALLSQLDLSDNAQLRS^/DPA 

TFHGLGRLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHVHPH^^ 

LRALQYLRLNDNPWCDCRARPLWAWLQKFRGSSSE^^ 

VATGPYHPIWTGRATDEEPLGLPKCCQPDAADKASVLEPGRPASAGNALKGRVPPGDSPPGN 

GSGPRHINDSPFGTLPGSAEPPLTAVRPEGSEPPGFPTSGPRRRPGCSRKNRTRSHCRLGQA 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALVLWTVLGPC 

Important features: 
Signal peptide: 

amino acids 1-2 6 

Leucine zipper pattern. 

amino acids 135-156 

Glycosaminoglycan attachment site. 

amino acids 436-439 

N-glycosylation site. 

amino acids 82-85, 179-183, 237-240, 372-375 and 423-426 

VWFC domain 

amino acids 411-425 
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F IG UR E 16 2 

GGAAGTCCACGGGGAGCTTGGATGCCAAAGGGAGGACGGCTGGGTCCTCTGGAGAGGACTAC 
TCACTGGCATATTTCTGAGGTATCTGTAGAATAACCACAGCCTCAGATACTGGGGACTTTAC 
AGTCCCACAGAACCGTCCTCCCAGGAAGCTGAATCCAGCAAGAACAA2SGAGGCCAGCGGGA 
AGCTCATTTGCAGACAAAGGCAAGTCCTTTTTTCCTTTCTCCTTTTGGGCTTATCTCTGGCG 
GGCGCGGCGGAACCTAGAAGCTATTCTGTGGTGGAGGAAACTGAGGGCAGCTCCTTTGTCAC 
CAATTTAGCAAAGGACCTGGGTCTGGAGCAGAGGGAATTCTCCAGGCGGGGGGTTAGGGTTG 
TTTCCAGAGGGAACAAACTACATTTGCAGCTCAATCAGGAGACCGCGGATTTGTTGCTAAAT 
GAG AAA T T GGAC C G T G AGGAT C TGTGCGGT C AC ACAGAGC C C T G T GT GC TAC GT T T C CAAG T 
GT T G C T AGAGAG T C C CT T C GAGT T T TTT CAAGC TGAGCT GCAAGTAATAGACATAAACGACC 
AC T C T C C AGT AT T T C T GG AC AAACAAAT G T T GG T GAAAG TAT CAGAGAGCAGT CCTCCTGGG 
AC TAC GTTTCCTCT GAAGAAT GC C GAAGAC T TAGATGTAGGC CAAAACAATAT T GAGAACTA 
TATAATCAGCCCCAACTCCTATTTTCGGGTCCTCACCCGCAAACGCAGTGATGGCAGGAAAT 
ACCCAGAGCTGGTGCTGGACAAAGCGCTGGACCGAGAGGAAGAAGCTGAGCTCAGGTTAACA 
CTCACAGCACTGGATGGTGGCTCTCCGCCCAGATCTGGCACTGCTCAGGTCTACATCGAAGT 
CCTGGATGTCAACGATAATGCCCCTGAATTTGAGCAGCCTTTCTATAGAGTGCAGATCTCTG 
AGGACAGTCCGGTAGGCTTCCTGGTTGTGAAGGTCTCTGCCACGGATGTAGACACAGGAGTC 
AAC G GAG AG AT T T C C TAT T C AC TTT T CCAAGCT T CAGAAGAGAT T GGCAAAACC T T TAAGAT 
CAAT C C C T T GAC AGG AGAAAT T GAACTAAAAAAAC AAC T C GAT T T C GAAAAAC T T CAG T C CT 
AT G AAG T C AAT AT T GAG G C AAG AGAT G C T GG AACCT TT T C T GGAAAAT G CACCGT T CT GAT T 
CAAG T G AT AGAT G T GAAC GAC CAT G C CC C AGAAGTT AC C AT GT CT G CAT T TAC C AGC C CAAT 
ACCTGAGAACGCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAGGAG 
AAAATGGGAAAATTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAA 
AAC TTT T ACAC C C TAC T AAC GGAGAGAC CACTAGACAGAGAAAGC AGAG C GGAATACAACAT 
CACTATCACTGTCACTGACTTGGGGACCCCTATGCTGATAACACAGCTCAATATGACCGTGC 
TGATCGCCGATGTCAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCGTC 
CGCGAGAACAACAGCCCCGCCCTGCACATCCGCAGCGTCAGCGCTACAGACAGAGACTCAGG 
CACCAACGCCCAGGTCACCTACTCGCTGCTGCCGCCCCAGGACCCGCACCTGCCCCTCACAT 
CCCTGGTCTCCATCAACGCGGACAACGGCCACCTGTTCGCCCTCAGGTCTCTGGACTACGAG 
GCCCTGCAGGGGTTCCAGTTCCGCGTGGGCGCTTCAGACCACGGCTCCCCGGCGCTGAGCAG 
CGAGGCGCTGGTGCGCGTGGTGGTGCTGGACGCCAACGACAACTCGCCCTTCGTGCTGTACC 
CGCTGCAGAACGGCTCCGCGCCCTGCACCGAGCTGGTGCCCCGGGCGGCCGAGCCGGGCTAC 
CTGGT GAC CAAGGTGGTGGCGGT GGAC GGCGACTCGGGCCAGAACGCCTGGCTGTCGTACCA 
GCTGCTCAAGGCCACGGAGCTCGGTCTGTTCGGCGTGTGGGCGCACAATGGCGAGGTGCGCA 
CCGCCAGGCTGCTGAGCGAGCGCGACGCGGCCAAGCACAGGCTGGTGGTGCTGGTCAAGGAC 
AATGGCGAGCCTCCGCGCTCGGCCACCGCCACGCTGCACGTGCTCCTGGTGGACGGCTTCTC 
CCAGCCCTACCTGCCTCTCCCGGAGGCGGCCCCGACCCAGGCCCAGGCCGACTTGCTCACCG 
TCTACCTGGTGGTGGCGTTGGCCTCGGTGTCTTCGCTCTTCCTCTTTTCGGTGCTCCTGTTC 
GTGGCGGTGCGGCTGTGTAGGAGGAGCAGGGCGGCCTCGGTGGGTCGCTGCTTGGTGCCCGA 
GGGCCCCCTTC CAG G G CAT C T T GT GGAC AT GAGC GGCAC C AGGAC C C TAT C C CAGAGC T AC C 
AGTATGAGGTGTGTCTGGCAGGAGGCTCAGGGACCAATGAGTTCAAGTTCCTGAAGCCGATT 
ATCCCCAACTTCCCTCCCCAGTGCCCTGGGAAAGAAATACAAGGAAATTCTACCTTCCCCAA 
T AAC TTTGGGTT CAATAT T C AGTg&C CATAGT T GAC TT T T ACAT T C C AT AGGTAT T T TAT T T 
TGTGGCATTTCCATGCCAATGTTTATTTCCCCCAATTTGTGTGTATGTAATATTGTACGGAT 
TTACTCTTGATTTTTCTCATGTTCTTTCTCCCTTTGTTTTAAAGTGAACATTTACCTTTATT 
CCTGGTTCTT 
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FIGURE 163 



</usr/seqdb2/sst/DNA/I)naseqs .min/ss *DNA4831 4 
<subunit 1 of 1, 798 aa, 1 stop 
<MW: 87552, pi: 4.84, NX (S/T) : 5 

ME AS GKL I CRQRQVL FS FLLLGLS LAGAAE PRS YSWEETEGS S FVTNLAKDLGLE QREFSR 
RGVRWS RGNKLHLQLNQE TADLLLNEKLDRE DLCGH7E PCVLRFQVLLES P FE FFQAELQV 
IDINDHS PVFLDKQMLVKVSESSPPGTTFPLKNAEDLDVGQNNIENYI I SPNSYFRVLTRKR 
SDGRKYPELVLDKALDREEEAELRLTLTALDGGSPPRSGTAQVYIEVLDVNDNAPEFEQPFY 
RVQISEDSPVGFLVVKVSATDVDTGVNGEISYSLFQASEEIGKTFKINPLTGEIELKKQLDF 
EKLQSYEVNIEARDAGTFSGKCTVLIQVIDVNDHAPEVTMSAFT5PIPENAPETWALFSVS 
DLDSGENGKISCSIQEDLPFLLKSAENFYTLLTERPLDRESRAEYNITITVTDLGTPMLITQ 
LNMTVLIADVNDNAPAFTQTSYTLFVRENNSPALHIRSVSATDRDSGTNAQVTYSLLPPQDP 
HL P L T S LVS INADNGHL FALRS LDYEALQGFQFRVGASDHGS PALS SEALVRVWLDANDNS 

PFVLYPLQNGSAPCTELVPRAAEPGYLVTKWAVDGDSGQNAWLSYQLLKATELGLFGVWAH 

NGEVRTARLLSERDAAKHRLWLVKDNGEPPRSATATLHVXLVDGFSQPYLPLPEAAPTQAQ 

ADLLTVYLWA1LA.SVSSLFLFSVLLFVAVRLCRRSRAA.SVGRCLVPEGPLPGHLVDMSGTRT 
LS QS YQYEVC LAGGS GTNE FKFLKP IIPNFPPQC PGKE IQGNST FPNN FGFNIQ 

Important features: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 
amino acids 685-712 

Cadherins extracellular repeated domain signature. 

amino acids 122-132, 231-241, 336-346, 439-449 and 549-559 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 285-292 

N-glycosylation site. 

amino acids 418-421, 436-439, 567-570 and 786-789 
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FIGUR E 164 

ACCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCGCGTAGCCGTGC 

GCCGATTGCCTCTCGGCCTGGGCAMSGTCCCGGCTGCCGGTCGACGACCGCCCCGCGTCAT 

GCGGCTCCTCGGCTGGTGGCAAGTATTGCTGTGGGTGCTGGGACTTCCCGTCCGCGGCGTGG 

AGGTTGCAGAGGAAAGTGGTCGCTTATGGTCAGAGGAGCAGCCTGCTCACCCTCTCCAGGTG 

GGGGCTGTGTACCTGGGTGAGGAGGAGCTCCTGCATGACCCGATGGGCCAGGACAGGGCAGC 

AGAAGAGGCCAATGCGGTGCTGGGGCTGGACACCCAAGGCGATCACATGGTGATGCTGTCTG 

TGATTCCTGGGGAAGCTGAGGACAAAGTGAGTTCAGAGCCTAGCGGCGTCACCTGTGGTGCT 

GGAGGAGCGGAGGACTCAAGGTGCAACGTCCGAGAGAGCCTTTTCTCTCTGGATGGCGCTGG 

AGCACACTTCCCTGACAGAGAAGAGGAGTATTACACAGAGCCAGAAGTGGCGGAATCTGACG 

CAGCCCCGACAGAGGACTCCAATAACACTGAAAGTCTGAAATCCCCAAAGGTGAACTGTGAG 

GAGAGAAACATTACAGGATTAGAAAATTTCACTCTGAAAATTTTAAATATGTCACAGGACCT 

TATGGATTTTCTGAACCCAAACGGTAGTGACTGTACTCTAGTCCTGTTTTACACCCCGTGGT 

GCCGCTTTTCTGCCAGTTTGGCCCCTCACTTTAACTCTCTGCCCCGGGCATTTCCAGCTCTT 

CACTTTTTGGCACTGGATGCATCTCAGCACAGCAGCCTTTCTACCAGGTTTGGCACCGTAGC 

TGTTCCTAATATTTTATTATTTCAAGGAGCTAAACCAATGGCCAGATTTAATCATACAGATC 

GAACACTGGAAACACTGAAAATCTTCATTTTTAATCAGACAGGTATAGAAGCCAAGAAGAAT 

GTGGTGGTAACTCAAGCCGACCAAATAGGCCCTCTTCCCAGCACTTTGATAAAAAGTGTGGA 

CTGGTTGCTTGTATTTTCCTTATTCTTTTTAATTAGTTTTATTATGTATGCTACCATTCGAA 

CTGAGAGTATTCGGTGGCTAATTCCAGGACAAGAGCAGGAACATGTGGAGT^GTGATGGTCT 

GAAAGAAGTTGGAAAGAGGAACTTCAATCCTTCGTTTCAGAAATTAGTGCTACAGTTTCATA 

CATTTTCTCCAGTGACGTGTTGACTTGAAACTTCAGGCAGATTAAAAGAATCATTTGTTGAA 

CAACTGAATGTATAAAAAAATTATAAACTGGTGTTTTAACTAGTATTGCAATAAGCAAATGC 
AAAAATAT T C AAT AG 



J66 jt2>=t 
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FIGUR E 16 5 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss,DNA48333 
xsubunit 1 of 1, 360 aa, 1 stop 
XMW: 39885, pi: 4.79, NX(S/T): 7 
MVPAAGRRPPRVMRLLGWWQVLLWLGLPVRGVEVAEESGRLWSE 
ELLHDPMGQDRAAEEANAVLGLDTQGDHMVMLSVIPGEAEDKVSSEPSGV^ 

WRESLFSLDGAGAHFPDREEEYYTEPEVAESDAAPTEDSNNTESLKSPKVNCEERNITGLE 
NFTLKILNMSQDLMDFLNPNGSDCTLV^ 

QHSSLSTRFGTVAVPNILLFQGAKPMARFNHTDRTLETLKIFI FNQTGIEAKKNVVVTQADQ 
IGPLPSTLIKSVDWLLVFSLFFLISFIMYATIRTESIRWLIPGQEQEHVE 

Important features: 
Signal peptide: 

amino acids 1-25 

Transmembrane domain: 

amino acids 321-340 

Homologous region to dilsufide isomerase 

amino acids 212-302 

N-glycosylation site. 

amino acids 165-168, 181-184, 187-190, 194-197, 206-209, 278-281 
and 293-296 

Thioredoxin domain 

amino acids 211-227 



WO 99/46281 



PCT/US99/05028 



FIGU RE 1 6 6 



CCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCACGaaSCTGCAGGGCCCTGGCT 
CGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTTCCTC 
TTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATCCCGGTCAACCTGCA 
GCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACGAGACCA 
TGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCG 
GACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGAC 
CATCCAGCCATGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCG 
CCTTCGGCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTT 
TGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATG 
T G AAG C C T GC AAAAAT AAAAAT GAT GAT GAG AAC G AC ATAAT GGAAACGCT T T G T AAAAATG 
AT T T T GC AC T G AAAAT AAAAGT GAAGGAGATAAC C TACAT CAAC CGAGATAC CAAAAT CAT C 
CTG GAGAC CAAGAGCAAGAC CATTTACAAGC T GAACGGT GTGTCCGAAAGGGACC TGAAGAA 
AT CGGTGCTGTGGCT C AAAG ACAGC TT GC AG T GCAC C T GT GAGGAGATGAAC GAC AT CAACG 
CGCCCTATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGG 

TGGCAGAAGGGGCAGAGAGAGTTCAAG'CGCATCTCCCGCAGCATCCGCAAGCTGCAGTGC3a 
STCCCGGCATCCTGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCC 
GGGATCTCAGCTCCCGTTCCCCAAGCACACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCT 
TCCCCCTGCCTTTTGCACGTTTGCATCCCCAGCATTTCCTGAGTTATAAGGCCACAGGAGTG 
GAT AG C T GT T T T C AC C TAAAGGAAAAGC C CAC C C GAAT C T TG T AGAAATAT T CAAAC TAATA 
AAATCATGAATATTTTAA 




WO 99/46281 PCT/US99/05028 

FIGURE 167 

></usr/seqdb2/sst/DNA/Dnaseqs,min/ss.DNA50920 
xsubunit 1 of 1, 295 aa, 1 stop 
><MW: 33518/ pi; 7.74, NX(S/T): 0 

MLQGPGSLLLLFLASHCCLGSARGLFLFGQPDFSYKRSNCKPIPWLQLCHGIEYQNMRLFN 
LLGHE TMKE VLE QAGAW I P LVMKQCHPDTKKFLCS LFAPVCLDDLDE T I QPCHSLCV QVKDR 
CAPVMSAFGFPWPDMLECDRFPQDNDLCIPIASSra^ 

ETLCKNDFALKIKVKEITYINRDTKIILETKSKTIYKLNGVSERDLKKSVLWLKDSLQCTCE 
EMND I NAP YLVMGQKQGGE LVITS VKRWQKGQRE FKR ISRSI RKLQC 

Important features: 
Signal peptide: 

amino acids 1-20 

Cysteine rich domain, homolgous to frizzled N terminus 

amino acids 6-153 
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FIGURE 168 



GTGGAGGCCGCCGACG&ISGCGGGGCCGACGGAGGCCGAGACGGGGTTGGCCGAGCCCCGGG 

CCCTGTGCGCGCAGCGGGGCCACCGCACCTACGCGCGCCGCTGGGTGTTCCTGCTCGCGATC 
AGCCTGCTCAACTGCTCCAACGCCACGCTGTGGCTCAGCTTTGCACCTGTGGCTGACGTCAT 
TGCTGAGGACTTGGTCCTGTCCATGGAGCAGATCAACTGGCTGTCACTGGTCTACCTCGTGG 
TATCCACCCCATTTGGCGTGGCGGCCATCTGGATCCTGGACTCCGTCGGGCTCCGTGCGGCG 
ACCATCCTGGGTGCGTGGCTGAACTTTGCCGGGAGTGTGCTACGCATGGTGCCCTGCATGGT 

TGTTGGGACCCAAAACCCATTTGCCTTCCTCATGGGTGGCCAGAGCCTCTGTGCCCTTGCCC 
AGAGCCTGGTCATCTTCTCTCCAGCCAAGCTGGCTGCCTTGTGGTTCCCAGAGCACCAGCGA 
GCCACGGCCAACATGCTCGCCACCATGTCGAACCCTCTGGGCGTCCTTGTGGCCAATGTGCT 
GTCCCCTGTGCTGGTCAAGAAGGGTGAGGACATTCCGTTAATGCTCGGTGTCTATACCATCC 
CTGCTGGCGTCGTCTGCCTGCTGTCCACCATCTGCCTGTGGGAGAGTGTGCCCCCCACCCCG 
CCCTCTGCCGGGGCTGCCAGCTCCACCTCAGAGAAGTTCCTGGATGGGCTCAAGCTGCAGCT 
CATGTGGAACAAGGCCTATGTCATCCTGGCTGTGTGCTTGGGGGGAATGATCGGGATCTCTG 
CCAGCTTCTCAGCCCTCCTGGAGCAGATCCTCTGTGCAAGCGGCCACTCCAGTGGGTTTTCC 
GGCCTCTGTGGCGCTCTCTTCATCACGTTTGGGATCCTGGGGGCACTGGCTCTCGGCCCCTA 
TGTGGACCGGACCAAGCACTTCACTGAGGCCACCAAGATTGGCCTGTGCCTGTTCTCTCTGG 
CCTGCGTGCCCTTTGCCCTGGTGTCCCAGCTGCAGGGACAGACCCTTGCCCTGGCTGCCACC 
TGCTCGCTGCTCGGGCTGTTTGGCTTCTCGGTGGGCCCCGTGGCCATGGAGTTGGCGGTCGA 
GTGTTCCTTCCCCGTGGGGGAGGGGGCTGCCACAGGCATGATCTTTGTGCTGGGGCAGGCCG 
AGGGAATACTCATCATGCTGGCAATGACGGCACTGACTGTGCGACGCTCGGAGCCGTCCTTG 

TCCACCTGCCAGCAGGGGGAGGATCCACTTGACTGGACAGTGTCTCTGCTGCTGATGGCCGG 
CCTGTGCACCTTCTTCAGCTGCATCCTGGCGGTCTTCTTCCACACCCCATACCGGCGCCTGC 
AGGCCGAGTCTGGGGAGCCCCCCTCCACCCGTAACGCCGTGGGCGGCGCAGACTCAGGGCCG 
GGTGTGGACCGAGGGGGAGCAGGAAGGGCTGGGGTCCTGGGGCCCAGCACGGCGACTCCGGA 
GT GC AC G G C GAG GGGGGCCTCGC TAGAG G AC C C G AGAGG GC C C GGGAGCC C C C AC C CAGC CT 
GCCACCGAGCGACTCCCCGTGCGCAAGGCCCAGCAGCCACCGACGCGCCCTCCCGCCCCGGC 
AGACTCGCAGGCAGGGTCCAAGCGTCCAGGTTTATTGACCCGGCTGGGTCTCACTCCTCCTT 
CTCCTCCCCGTGGGTGATCACG2^£CTGAGCGCCTTGTAGTCCAGGTTGCCCGCCACATCGA 
TGGAGGCGAACTGGAACATCTGGTCCACCTGCGGGCGGGGGCGAAAGGGCTCCTTGCGGGCT 
CCGGGAGCGAATTACAAGCGCGCACCTGAAAA 




WO 99/46281 



PCT/US99/05028 



FIGURE 169 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss • DNA50988 
Xsubunit 1 of 1, 560 aa, 1 stop 
XMW: 53427, pi: 6,86, NX(S/T): 2 

MAGP TE AET G LAE PRALCAQRGHRT YARRWVFLLAI SLLNC SNATLWLS FAPVADVIAEDLV 
LSME Q I NWL S LVYLWS T ? FGVAAI W I LDS VGLRAAT I LGAWLNFAGSVLRMVPCMVVGTQN 
PFAFLMGGQSLCALAQSLVIFSPAKLAAL^^ 

KKGEDIPLMLGVYTIPAGWCLLSTICLWESVPPTPPSAGAA5STSEKFLDGLKLQLMWNKA 
YVILAVCLGGMIGISASFSALLEQILCASGKSSGFSGLCGALFITFGILGALALGPYVDRTK 
HFTEATKIGLCLFSLACVPFALVSQLQGQ 
GEGAATGMIFVLGQAEGILIMIAMTALT^ 

SCILAVFFHTPYRRLQAESGEPPSTRNAVGGADSGPGVDRGGAGRAGVLGPSTATPECTARG 
ASLEDPRGPGSPHPACHRATPRAQGPAATDAPSRPGRLAGRVQASRFIDPAGSHSSFSSPWVIT 

Important features: 

Potential Transmembrane domains : 

amino acids 30-50, 61-79, 98-112, 126-146, 169-182, 201-215, 248- 
268, 280-300, 318-337, 341-357, 375-387, 420-441 

N-glycosylation site. 

amino acids 4 0-43 and 4 3-4 6 

Glycosaminoglycan attachment site, 
amino acids 4 68-471 
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FIGURE 17 QA 



GTCCCACATCCTGCTCAACTGGGTCAGGTCCCTCTTAGACCAGCTCTTGTCCATCATTTGCT 
GAAGTGGACCAACTAGTTCCCCAGTAGGGGGTCTCCCCTGGCAATTCTTGATCGGCGTTTGG 
ACATCTCAGATCGCTTCCAATGAAGATGGCCTTGCCTTGGGGTCCTGCTTGTTTCATAATCA 
T C T AAC TAT GGGAC AAGG T T GT GCC GGCAGC T CT GGGGGAAGGAGCACGGGGCT GAT CAAGC 
CAT C C AGGAAACAC T GG AGG AC T T G T C C AGC C T T GAAAGAAC T CTAGTGGT T T C T GAAT C TA 
GCCCACTTGGCGGTAAGCMS3ATGCAACTTCTGCAACTTCTGCTGGGGCTTTTGGGGCCAGG 
T GGC T AC T TAT T TC TT T TAGGGGAT T GT C AGGAGGT GAC C AC T CTCACGGT GAAATAC CAAG 

TGTCAGAGGAAGTGCCATCTGGTACAGTGATCGGGAAGCTGTCCCAGGAACTGGGCCGGGAG 
GAGAGGCGGAGGCAAGCTGGGGCCGCCTTCCAGGTGTTGCAGCTGCCTCAGGCGCTCCCCAT 
TCAGGTGGACTCTGAGGAAGGCTTGCTCAGCACAGGCAGGCGGCTGGATCGAGAGCAGCTGT 
GCCGACAGTGGGATCCCTGCCTGGTTTCCTTTGATGTGCTTGCCACAGGGGATTTGGCTCTG 
ATCCATGTGGAGATCCAAGTGCTGGACATCAATGACCACCAGCCACGGTTTCCCAAAGGCGA 
GCAGGAGCTGGAAATCTCTGAGAGCGCCTCTCTGCGAACCCGGATCCCCCTGGACAGAGCTC 
T TGAC CCAGACACAGGC C C TAACAC C CT GCACAC CTACACTCTGTCTCC CAGTGAGCACTTT 
G C C T T GGAT GT CAT TGTGGGCCCT GAT GAGAC CAAACAT GC AGAACTCATAGT GGT GAAGGA 
GCTGGACAGGGAAATCCATTCATTTTTTGATCTGGTGTTAACTGCCTATGACAATGGGAACC 
C C C C CAAG T C AG G T AC C AG C T T GG T CAAGGT C AAC GT C T T GGACTC CAAT GACAATAGC C C T 
GCGTTTGCTGAGAGTTCACTGGCACTGGAAA.TCCAAGAAGATGCTGCACCTGGTACGCTTCT 
C AT AAAAC T GAC C G C C ACAGAC C C T GAC CAAGGC C C CAAT GGGGAGGTGGAG TT C T T C C T CA 
GTAAGCACATGCCTCCAGAGGTGCTGGACACCTTCAGTATTGATGCCAAGACAGGCCAGGTC 
AT TCTGCGTC GAC C T C T AGAC TAT GAAAAGAAC C C T GC C TAC GAGGTGGAT G T T C AGGCAAG 
GGACCTGGGTCCCAATCCTATCCCAGCCCATTGCAAAGTTCTCATCAAGGTTCTGGATGTCA 
ATGACAACATCCCAAGCATCCACGTCACATGGGCCTCCCAGCCATCACTGGTGTCAGAAGCT 
C T T C C CAAG G AC AGTTTT AT TGCTCTTGT CAT GGCAGAT GACTTGGAT T CAGGACAC AATGG 
T T T G G T C CAC T GC T GGC T GAGC CAAGAGC T GGGCCACT T CAGGCTGAAAAGAAC TAAT GGCA 
AC AC AT AC AT G T T G C T AAC CAAT G C C AC AC T GGACAGAGAG C AGT GGC C CAAATATAC C CT C 
AC T C T G T TAG C C CAAGACCAAGGAC T C CAGCC C TTAT GAG C CAAGAAACAGC TCAGC ATT CA 
GATCAGTGACATCAACGACAATGCACCTGTGTTTGAGAAAAGCAGGTATGAAGTCTCCACGC 
G GG AAAAC AAC T TAC C C T C T C T TC AC C T CAT TAC CAT CAAGGC T CAT GAT GCAGAC T T GGGC 
AT TAAT G GAAAAGT C T CAT AC C GC AT C C AGGAC T CC C CAGT T GCT CACT TAG T AGC TAT T GA 
CTCCAACACAGGAGAGGTCACTGCTCAGAGGTCACTGAACTATGAAGAGATGGCCGGCTTTG 
AGTTCCAGGTGATCGCAGAGGACAGCGGGCAACCCATGCTTGCATCCAGTGTCTCTGTGTGG 
GTCAGCCTCTTGGATGCCAATGATAATGCCCCAGAGGTGGTCCAGCCTGTGCTCAGCGATGG 
AAAAGCCAGCCTCTCCGTGCTTGTGAATGCCTCCACAGGCCACCTGCTGGTGCCCATCGAGA 
CTCCCAATGGCTTGGGCCCAGCGGGCACTGACACACCTCCACTGGCCACTCACAGCTCCCGG 
CCATTCCTTTTGACAACCATTGTGGCAAGAGATGCAGACTCGGGGGCAAATGGAGAGCCCCT 
C TAC AGC AT C C G CAAT GGAAAT GAAGC C CAC C T C T T CAT C C T CAACC CT CAT AC GGGGCAGC 
TGTTCGTCAATGTCACCAATGCCAGCAGCCTCATTGGGAGTGAGTGGGAGCTGGAGATAGTA 
G T AGAGG AC C AGGGAAGC C C C C C C T TACAGAC C C GAGC C C T GT TGAGGG TCAT G T T T GT CAC 
CAGTGTGGACCACCTGAGGGACTCAGCCCGCAAGCCTGGGGCCTTGAGCATGTCGATGCTGA 
CGGTGATCTGCCTGGCTGTACTGTTGGGCATCTTCGGGTTGATCCTGGCTTTGTTCATGTCC 
ATCTGCCGGACAGAAAAGAAGGACAACAGGGCCTACAACTGTCGGGAGGCCGAGTCCACCTA 
C C GC C AGC AG C C CAAGAG G C C C CAGAAACAC AT T CAGAAGGCAGAC AT C CAC CTCGTGCCTG 
T GC T C AG G GG T C AGG C AG G T GAG C C T T G T GAAG T CGGGC AGTC CCACAAAGAT GT G GACAAG 
GAGGCGATGATGGAAGCAGGCTGGGACCCCTGCCTGCAGGCCCCCTTCCACCTCACCCCGAC 
CCTGTACAGGACGCTGCGTAATCAAGGCAACCAGGGAGCACCGGCGGAGAGCCGAGAGGTGC 
T GC AAGAC AC GGT C AAC CTCCTTTT C AAC CAT C C CAGGCAGAGGAAT GCCTCCC GG GAGAAC 
C T GAAC C T T C C C GAGC C C C AGC C T G C CAC AGG C CAGCCAC GTTC CAGGC CT C TGAAGGT T GC 
AGGCAGC C C CAC AGGGAGGC TGGCT GGAGACCAGGGCAGTGAGGAAGCCCCACAGAGGCCAC 
CAGCCTCCTCTGCAACCCTGAGACGGCAGCGACATCTCAATGGCAAAGTGTCCCCTGAGAAA 
GAATCAGGGCCCCGTCAGATCCTGCGGAGCCTGGTCCGGCTGTCTGTGGCTGCCTTCGCCGA 
GCGGAACCCCGTGGAGGAGCTCACTGTGGATTCTCCTCCTGTTCAGCAAATCTCCCAGCTGC 
TGTCCTTGCTG CAT C AGG G C CAATT C C AG C C CAAAC CAAAC CAC C GAGGAAATAAGTACTT G 
GCCAAGCCAGGAGGCAGCAGGAGTGCAATCCCAGACACAGATGGCCCAAGTGCAAGGGCTGG 
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FIGURE 170B 

AGGC C AGACAGAC C C AG AAC AGGAGGAAGGGC C TTTGGAT C C T GAAGAGGAC CT C T CT GT GA 
AGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCTGGACCCCAGCACAGGTCTGGCCCTGGAC 
CGGCTGAGCGCCCCTGACCCGGCCTGGATGGCGAGACTCTCTTTGCCCCTCACCACCAACTA 
CCGTGACAATGTGATCTCCCCGGATGCTGCAGCCACGGAGGAGCCGAGGACCTTCCAGACGT 
TCGGCAAGGCAGAGGCACCAGAGCTGAGCCCAACAGGCACGAGGCTGGCCAGCACCTTTGTC 
TCGGAGATGAGCTCACTGCTGGAGATGCTGCTGGAACAGCGCTCCAGCATGCCCGTGGAGGC 
CGCCTCCGAGGCGCTGCGGCGGCTCTCGGTCTGCGGGAGGACCCTCAGTTTAGACTTGGCCA 
C CAG T G C AGC C T C AG G CAT GAAAGT GCAAGGGGAC C CAGGT GGAAAGAC GGGGAC T GAGGGC 

AAGAGCAGAGGCAGCAGCAGCAGCAGCAGGTGCCTGTS&ACATACCTCAGACGCCTCTGGAT 
CCAAGAACCAGGGGCCTGAGGATCTGTGGACAAGAGCTGGTTTCTAAAATCTTGTAACTCAC 
T AG CT AG CGG C GG C CT G AG AACTT TAGGGT G AC T G AT GCTAC CCCCACAGAGGAGGCAAGAG 
CCCCAGGACTAACAGCTGACTGACCAAAGCAGCCCCTTGTAAGCAGCTCTGAGTCTTTTGGA 
GGACAGGGACGGTTTGTGGCTGAGATAAGTGTTTCCTGGCAAAACATATGTGGAGCACAAAG 
GGT C AG T C C T C T G GC AGAAC AGAT GC CAC GGAG TAT CACAGG C AGGAAAGGG TGGCCTTCTT 
GGGTAGCAGGAGTCAGGGGGCTGTACCCTGGGGGTGCCAGGAAATGCTCTCTGACCTATCAA 
TAAAGGAAAAGCAGTAAAAAAAAAA?\AAAPAAAAA 
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FIGURE 171 



</u5r/seqdb2/sst/DNA/Dnaseqs,min/ss.DNA48331 
<subunit 1 of 1, 1184 aa, 1 stop 
<MW: 129022, pi: 5.20, NX(S/T): 5 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQA 
GAAFQ VL QL P QAL P I QVD S EEGLLS TGRRLDREQLCRQWDPCLVS FDVLATGDLAL I HVE I Q 

VLDINDHQPRFPKGEQELEISESASLRTRIPLDRALDPDTGPNTLHTYTLSPSEHFALDVIV 
GPDETKHAEL I WKELDRE IHS FFDLVLTAYDNGNPPKSGTSLVKVKVLDSNDNS PAFAESS 

LALEIQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVLDTF5IDAKTGQVILRRPL 

DYEKNPAYEVDVQARDLGPNPIPAHCKVLIKVLDVNDNIPSIHVTWASQPSLVSEALPKDSF 

IALVMADDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQD 

QGLQPLSAKKQLSIQISDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVS 

YRIQDSPVAHLVAIDSNTGEVTAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDA 

NDNAPEWQPVLSDGKASLSVLVNASTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTT 

IVARDADSGANGEPLYSIRNGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIVVEDQGS 

PPLQTRALLRVMFVTSVDHLRDSARKPGALSMSMLTVICLAVLLGIFGLILALFMSICRTEK 
KDNRAYNCREAESTYRQQPKRPQKHIQKADIHLVP^ 

GWDPCLQAPFHLTPTLYRTLRNQGNQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEP 
QPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQ 
ILRSLVRLSVAAFAERNPVEELTVDSPPVQQISQLLSLLHQGQFQPKPNHRGNKYLAKPGGS 
RS AI P DT DG P S ARAGGQT D PEQEEGPLDPEEDLSVKQLLEEELS S LLDP S T GLALDRLSAPD 
P AWMARL S L P L T TNYRDNVI S P DAAATEE PRTFQT FGKAE APE LS P TGTRLAS T FVS EMS S L 

LEMLLEQRSSMPVEAASEALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSS 
SSSRCL 

Important features: 
Signal peptide: 

amino acids 1-13 

Transmembrane domain: 

amino acids 719-739 

N^-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 

Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 
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FIGUR E 172 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGG 

CAGACCGTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAG 

GCAGGAGCCTTCCTTACACTTCGCCAISAGTTTCCTCATCGACTCCAGCATCATGATTACCT 

CCCAGATACTATTTTTTGGATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTAT 

GAGATACGTCAGTATGTTGTACAGGTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCAT 

GTTTGAGCTCATCATCT7TGAAATCTTAGGAGTATTGAATAGCAGCTCCCGTTATTTTCACT 

GGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTTTCATGGTGCCTTTTTACATTGGC 

TATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTT7TTTCCTGTCTCTT 

ATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCCCATTCTCAGCCCAA 

AACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAGTGACTCTC 

ATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTCCT 
C AGG AAT G T GAC T G ACAC G GATAT T C T AGC C C T GGAAC GGC GAC TGC TGCAAAC CAT GGATA 

TGATCATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAA 
GT GC AT AAC AAAC C A T C AG G T T T C T GGGGAAT GAT AAAAAG T G T TAC CAC T T CAGCAT C AGG 

AAGTGAAAATCTTACTCTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGC 
TT T T T C T GG AAAC AG C T GAT CT ATAT GC TAC CAAGGAGAGAATAGAATAC T C CAAAACC T T C 

AAGGGGAAATATTTTAATTTTCTTGGTTACTTTTTCTCTATTTACTGTGTTTGGAAAATTTT 
CAT GGC TAC CAT C AAT AT T G TT TT T GAT C G AGTT G GGAAAAC GGAT C C T GT CACAAGAGGCA 
T T G AGAT CAC T G T G AAT TAT CT GGGAAT C C AATTT GAT GT GAAG TTTTGGTCC CAAC ACAT T 
TCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGATTGCTGATCACTCTTAC 
CAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTGCTATTAGCAC 
AGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTTAGAA 
TACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTT 
TGATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAAC 
AGGC AC C AG AGAAG C AAAT G GCAC C T T£zAAC T TAAGC C TAC T ACAGAC TGT TAGAGG C C AGT 
GG T T T CAAAAT T T AG AT AT AAGAGGGGGGAAAAAT GGAAC CAGGGC C TGAC ATT T TAT AAAC 
AAAC AAAAT G C TAT G G TAG C AT TT T T CAC C T T C ATAGCAT AC TCCTTCCC C GT C AGG T GATA 
CTATGACCATGAGTAGCATCAGCCAGAACATGAGAGGGAGAACTAACTCAAGACAATACTCA 
GCAGAGAGCATCCCGTGTGGATATGAGGCTGGTGTAGAGGCGGAGAGGAGCCAAGAAACTAA 
AGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGTCTATGGTAGCTGAGCCAAACACGT 
AGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTGCCTTGAGATTGACTCATT 
AAAAT C AGAGAC T G T AAC AAAAAAAAAAAAAAAAAAAAAGGGC GGC CGC GAC T C TAGAGT CG 
ACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 173 

MS FL I DS S IM I T S Q I L FFGFGWLFFMRQL FKD YE IRQYWQVI FS VT FAF3CTMFELIIFEI 
LGVLNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNI^LLHKQRLLFSCLLWLTFMYFF 
WKLGDPFPILSPKHGILSIEQLISRVGVIGV^ 
LALERRLLQTMDMIISKKKRMflMARRTM 

QEVDALEELSRQLFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVF 
DRVGKTDPVTRGIEITVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISS 
SKSSNVIVLLIAQIMGMYFVSSVLLIRMSMPLEYRTIITEVLGELQFNFYHRWFDVIFLVSA 
LS S I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 31-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, i80-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGU R E 17 4 

CATGGGAAGTGGAGCCGGAGCCTTCCTTACACTCGCCATGAGTTTCCTOITCGACTCCAGCA 
TCATGATTACCTCCCNGANACTATTTTTTGGATTTGGGTGGCTTTTCTTCNGCGCCAATGTT 
TAAAGACTATGAGATACGTCAGTATGTTGTACNGGTGATCTTCTCCGTGACGTTTGCCATTT 
C T T GC AC CAT G T ? T GAG C T CAT CATC TT T GAAAT C TTNGGAG T AT TGAATAGC AGC T C C C G T 
TATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTNTCATGGTGCCTTT 
T TACAT T GG C TAT T T TAT T G T GAGCAATAT C C GACTAC T GC ATAAACAACGACT GCT TTTT T 
CCTGTCTCTTATGGCTGACCTTTATGTATTTCCAG 
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GTG7TGCCCTTGGGGAGGGGAAGGGGAGCCNGGCCCTTTCCTAAAATTTGGCCAAGGGTTTC 
TTTNTTGAATTCCGGGTTNNGNATACCTTCCCAGAAAATATTTTTTGGATTTGGGGTAGNTT 
TT T T T CAT GC GC CAAT T GT T TAAAG ACT AT GAGAT AC GT C AGT ATGTTGT ACAGGT GATNT T 
NTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATNTTTGAP^ATNTTAGGAG 
TATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATC 
CTGGTTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCA 
TAAACAACGACTGCTTTTTTCCTGTCTNTTATGGCTGACCTTTATGTATTTNTTNTGGAAAN 
TAGG AGAT CCCTTTCC CAT TC T C 
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FIGUR E 176A 

CTCGCGCAGGGATCGTCCCAISGCCGGGGCTCGGAGCCGCGACCCTTGGGGGGCCTCCGGGA 
TTTGCTACCTTTTTGGCTCCCTGCTCGTCGAACTGCTCTTCTCACGGGCTGTCGCCTTCAAT 
CTGGACGTGATGGGTGCCTTGCGCAAGGAGGGCGAGCCAGGCAGCCTCTTCGGCTTCTCTGT 
GGCCCTGCACCGGCAGTTGCAGCCCCGACCCCAGAGCTGGCTGCTGGTGGGTGCTCCCCAGG 
CCCTGGCTCTTCCTGGGCAGCAGGCGAATCGCACTGGAGGCCTCTTCGCTTGCCCGTTGAGC 
C T GGAGGAGAC T GAC T G C T ACAGAG T G GACAT C GAC CAGG GAGC T GATAT GCAAAAGGAAAG 
CAAGGAGAAC C AGT GGTT GGGAGT CAGT GTT CGGAGCCAGGGGC CT GGGGGCAAGATTGTTA 
CCTGTGCACACCGATATGAGGCAAGGCAGCGAGTGGACCAGATCCTGGAGACGCGGGATATG 
ATTGGTCGCTGCTTTGTGCTCAGCCAGGACCTGGCCATCCGGGATGAGTTGGATGGTGGGGA 
ATGGAAGTTCTGTGAGGGACGCCCCCAAGGCCATGAACAATTTGGGTTCTGCCAGCAGGGCA 
CAGCTGCCGCCTTCTCCCCTGATAGCCACTACCTCCTCTTTGGGGCCCCAGGAACCTATAAT 
TGGAAGGGCACGGCCAGGGTGGAGCTCTGTGCACAGGGCTCAGCGGACCTGGCACACCTGGA 
CGACGGTCCCTACGAGGCGGGGGGAGAGAAGGAGCAGGACCCCCGCCTCATCCCGGTCCCTG 
CCAACAGCTACTTTGGCTTCTCTATTGACTCGGGGAAAGGTCTGGTGCGTGCAGAAGAGCTG 
AGCTTTGTGGCTGGAGCCCCCCGCGCCAACCACAAGGGTGCTGTGGTCATCCTGCGCAAGGA 
CAGCGCCAGTCGCCTGGTGCCCGAGGTTATGCTGTCTGGGGAGCGCCTGACCTCCGGCTTTG 
GC TAC T CAC TGGCTGTGGCT GAC C T CAACAGT GAT GGC T GG C CAGACC T GATAGT GGGTGC C 
CCCTACTTCTTTGAGCGCCAAGAAGAGCTGGGGGGTGCTGTGTATGTGTACTTGAACCAGGG 
GGGTCACTGGGCTGGGATCTCCCCTCTCCGGCTCTGCGGCTCCCCTGACTCCATGTTCGGGA 
TCAGCCTGGCTGTCCTGGGGGACCTCAACCAAGATGGCTTTCCAGATATTGCAGTGGGTGCC 
CCCTTTGATGGTGATGGGAAAGTCTTCATCTACCATGGGAGCAGCCTGGGGGTTGTCGCCAA 
ACCTTCACAGGTGCTGGAGGGCGAGGCTGTGGGCATCAAGAGCTTCGGCTACTCCCTGTCAG 
GCAGCTTGGATATGGATGGGAACCAATACCCTGACCTGCTGGTGGGCTCCCTGGCTGACACC 
GCAGTGCTCTTCAGGGCCAGACCCATCCTCCATGTCTCCCATGAGGTCTCTATTGCTCCACG 
AAGCATCGACCTGGAGCAGCCCAACTGTGCTGGCGGCCACTCGGTCTGTGTGGACCTAAGGG 
TCTGTTTCAGCTACATTGCAGTCCCCAGCAGCTATAGCCCTACTGTGGCCCTGGACTATGTG 
TTAGATGCGGACACAGACCGGAGGCTCCGGGGCCAGGTTCCCCGTGTGACGTTCCTGAGCCG 
TAACCTGGAAGAACCCAAGCACCAGGCCTCGGGCACCGTGTGGCTGAAGCACCAGCATGACC 
GAGT C T G T GGAGAC G C CAT GTT C C AGC T C C AGGAAAAT GT C AA&GACAAGC T T CGGGCCAT T 
GTAGTGACCTTGTCCTACAGTCTCCAGACCCCTCGGCTCCGGCGACAGGCTCCTGGCCAGGG 
GCTGCCTCCAGTGGCCCCCATCCTCAATGCCCACCAGCCCAGCACCCAGCGGGCAGAGATCC 
ACTTCCTGAAGCAAGGCTGTGGTGAAGACAAGATCTGCCAGAGCAATCTGCAGCTGGTCCAC 
GCCCGCTTCTGTACCCGGGTCAGCGACACGGAATTCCAACCTCTGCCCATGGATGTGGATGG 
AACAAC AG CCCTGTTT GCAC TGAGT G GGCAGC CAGT CATT G GC C T GGAG C T GAT G GT CAC C A 
ACCTGCCATCGGACCCAGCCCAGCCCCAGGCTGATGGGGATGATGCCCATGAAGCCCAGCTC 
CTGGTCATGCTTCCTGACTCACTGCACTACTCAGGGGTCCGGGCCCTGGACCCTGCGGAGAA 
GCCACTCTGCCTGTCCAATGAGAATGCCTCCCATGTTGAGTGTGAGCTGGGGAACCCCATGA 
AGAGAGGTGCCCAGGTCACCTTCTACCTCATCCTTAGCACCTCCGGGATCAGCATTGAGACC 
ACGGAACTGGAGGTAGAGCTGCTGTTGGCCACGATCAGTGAGCAGGAGCTGCATCCAGTCTC 
TGCACGAGCCCGTGTCTTCATTGAGCTGCCACTGTCCATTGCAGGAATGGCCATTCCCCAGC 
AACTCTTCTTCTCTGGTGTGGTGAGGGGCGAGAGAGCCATGCAGTCTGAGCGGGATGTGGGC 
AGCAAGGT CAAG TAT GAG G T CAC GG T TT C CAAC CAAGGCCAG T C GC T CAGAAC C C T GG GCT C 
TGCCTTCCTCAACATCATGTGGCCTCATGAGATTGCCAATGGGAAGTGGTTGCTGTACCCAA 
TGCAGGTTGAGCTGGAGGGCGGGCAGGGGCCTGGGCAGAAAGGGCTTTGCTCTCCCAGGCCC 
AACAT C C T C CAC C T G GAT GT GGAC AGT AGGGAT AGGAGGC GGC GGGAGCT GGAGC C AC CT GA 
GCAGCAGGAGCCTGGTGAGCGGCAGGAGCCCAGCATGTCCTGGTGGCCAGTGTCCTCTGCTG 
AGAAGAAGAAAAACATCACCCTGGACTGCGCCCGGGGCACGGCCAACTGTGTGGTGTTCAGC 
TGCCCACTCTACAGCTTTGACCGCGCGGCTGTGCTGCATGTCTGGGGCCGTCTCTGGAACAG 
CACCTTTCTGGAGGAGTACTCAGCTGTGAAGTCCCTGGAAGTGATTGTCCGGGCCAACATCA 
CAGTGAAGTCCTCCATAAAGAACTTGATGCTCCGAGATGCCTCCACAGTGATCCCAGTGATG 
GTATACTTGGACCCCATGGCTGTGGTGGCAGAAGGAGTGCCCTGGTGGGTCATCCTCCTGGC 
TGTACTGGCTGGGCTGCTGGTGCTAGCACTGCTGGTGCTGCTCCTGTGGAAGATGGGATTCT 
TCAAACGGGCGAAGCACCCCGAGGCCACCGTGCCCCAGTACCATGCGGTGAAGATTCCTCGG 
GAAGACCGACAGCAG T T CAAGGAGGAGAAGAC GGGCAC CAT C C T GAGGAACAAC T GGGGCAG 
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CCCCCGGCGGGAGGGCCCGGATGCACACCCCATCCTGGCTGCTGACGGGCATCCCGAGCTGG 
GCCCCGATGGGCATCCAGGGCCAGGCACCGCCl&SGTTCCCATGTCCCAGCCTGGCCTGTGG 
CTGCCCTCCATCCCTTCCCCAGAGATGGCTCCTTGGGATGAAGAGGGTAGAGTGGGCTGCTG 
GTGTCGCATCAAGATTTGGCAGGATCGGCTTCCTCAGGGGCACAGACCTCTCCCACCCACAA 
GAAC T C C T C C C AC C CAAC T T C C C C T TAGAGT GC T GT GAGAT GAGAG T GGGTAAATCAGGGAC 
AGGGCCATGGGGTAGGGTGAGAAGGGCAGGGGTGTCCTGATGCAAAGGTGGGGAGAAGGGAT 
CCTAATCCCTTCCTCTCCCATTCACCCTGTGTAACAGGACCCCAAGGACCTGCCTCCCCGGA 
AGTGCCTTAACCTAGAGGGTCGGGGAGGAGGTTGTGTCACTGACTCAGGCTGCTCCTTCTCT 
AGTTTCCCCTCTCATCTGACCTTAGTTTGCTGCCATCAGTCTAGTGGTTTCGTGGTTTCGTC 
TATTTATTAA?\AAATATTTGAGAACAAAAAAAAAAAAAAAAAAAA 




WO 99/46281 



PCT/US99/05028 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA55737 
xsubunit I of 1, 1141 aa, 1 stop 
XMW: 124671, pi: 5.82, NX(S/T): 5 

MAGARSRDPWGASGICYLFGSLLVELLFSRAVAFNLDVMGALRKEGEPGSLFGFSVALHRQL 

QPRPQSWLLVGAPQALALPGQQANRTGGLFACPLSLEETDCYR T vnDIDQGADMQKESKENQWL 

GVSVRSQGPGGKIVTCAHRYEARQRVDQILETRDMIGRCFVLSQDLAIRDELDGGEWKFCEG 

RPQGHEQFGFCQQGTiWlF5PDSHYLLFGAPGTYNWKGTARVELCAQGSAD3^AHLDDGPYEA 

GGEKEQDPRLIPVPANSYFGFSIDSGKGLVRAEELSFVAGAPRANHKGAWILRKDSASRLV 

PEVMLSGERLTSGFGYSLAVADLNSDGWPDLIVGAPYFFERQEELGGAVTVYLNQGGHWAGI 

SPLRLCGSPDSMFGISLAVLGDLNQDGFPDIAVGAPFDGDGKVFIYHGSSLGWAKPSQVLE 

GEAVGIKS FGYSLSGSLDMDGNQYPDLLVGSLADTAVLFRARPILHVSHEVS IAPRSIDLEQ 

PNCAGGHSVCVDLRVCFSYIAVPSSYSPTVALDYVLDADTDRRLRGQVPRVTFLSRNLEEPK 

HQASGTVWLKHQHDRVCGDAMFQLQENVKDKLRAIWTLSYSLQTPRLRRQAPGQGLPPVAP 

ILNAHQPSTQRAEIHFLKQGCGEDKICQSNLQLVHARFCTRVSDTEFQPLPMDVDGTTALFA 

LSGQPVIGLELMVTNLPSDPAQPQADGDDAHEAQLLVMLPDSLHYSGVRALDPAEKPLCLSN 

ENASHVECELGNPMKRGAQVTFYLIL.STSGISIETTELEVELLLATISEQELHPVSARARVF 

IELPLSIAGMAIPQQLFFSGVWGERAMQSERDVGSKVKYEVTVSNQGQSLRTLGSAFLNIM 

WPHEIANGKWLLYPMQVELEGGQGPGQKGLCSPRPNILHLDVDSRDRRRRELEPPEQQEPGE 

RQEPSMSWWPVSSAEKKKNITLDCARGTANCWFSCPLYSFDRAAVLHWGRLWNSTFLEEY 

SAVKSLEVIVRANITVKSS I KNLMLRDAS TVI PVMVYLDPMAWAEGVPWWVILLAVLAGLL 

VLALLVLLLWKMGFFKRAKHPEATVPQ 

DAHPILAADGHPELGPDGHPGPGTA 



Important features : 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

amino acids 1039-1064 



N-glycosylation sites. 

amino acids 86-89, 746-749, 949-952, 985-988 and 1005-1008 



Integrins alpha chain proteins . 

amino acids 1064-1071, 384-408, 1041-1071, 317-346, 443-465, 385- 
407, 215-224, 634-647, 85-99, 322-346, 470-479, 442-466, 379-408 
and 1031-1047 
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CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGA 
AAGCAGCGAGTTGGCAGAGCAGGGCTGCATTTCCAGCAGGAGCTGCGAGCACAGTGCTGGCT 
CACAACAAG^XSGTCAAGGTGTCAGCCGTACTGTGTGTGTGTGCAGCCGCTTGGTGCAGTCA 
GTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGGTAATTTTC 
T GGATGATAAACAAT GGC T CAC C ACAATC T C TCAGTAT GACAAGGAAGT C GGACAGT GGAAC 
AAA.TTCCGAGACGAAGTAGAGGATGATTATTTCCGCACTTGGAGTCCAGGAAAACCCTTCGA 
TCAGGCTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTAT 
GCATTGCTCAAGATTCTCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATG 
AAAGAAGCAGGAGTAGACCATAGGCAGTGGAGGGGTCCCATATTATCCACCTGCAAGCAGTG 
CCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCATACCTACTCTTTTCAGTGCA 
AACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTCAGTCAAA.TGTGAAGGACATTGC 
C CAT G T C C T T C AG AT AAGC C CAC CAGTACAAGCAGAAAT G T TAAGAGAGCAT GC AGT GAG C T 
GGAGTTCAGGGAAGTGGCAAACAGATTGCGGGACTGGTTCAAGGCCCTTCATGAAAGTGGAA 
G T C AAAAC AAG AAG AC AAAAAC AT T GC T GAGG C C T GAGAGAAG C AGAT T C GAT AC C AGCAT C 
TTGCCAATTTGCAAGGACTCACTTGGCTGGATGTTTAACAGACTTGATACAAACTATGACCT 
GCTATTGGACCAGTCAGAGCTCAGAAGCATTTACCTTGATAAGAATGAACAGTGTACCAAGG 
CATTCTTCAATTCTTGTGACACATACAAGGACAGTTTAATATCTAATAATGAGTGGTGCTAC 
TGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCA 
AGGG G T AAAGAAG C T C C TAG GACAGTATATCC CCCT G T G T GATGAAGAT GGTTAC T ACAAGC 
C AAC AC AAT G T CAT GGC AG T GTT GGACAG TGC T GGT G T G T T GACAGATATGGAAAT GAAGTC 
AT GGG AT CC AGAAT AAAT G G T GT T GCAGAT T GT GC TAT AGAT T T T GAGAT CTCC GGAGAT T T 
TGC TAGTGGCGAT T T TCATGAATGGACTGATGATGAGGATGATGAAGACGATATTATGAATG 
ATGAAGATGAAATTGAAGATGATGATGAAGATGAAGGGGATGATGATGATGGTGGTGATGAC 
CAT GAT G TAT AC AT TT££&T T GATGACAGTTGAAATCAATAAATTCTACATTTCTAATATTTA 
C AAAAA T GAT AG C C TAT T T AAAAT TATCTTCTTCCC C AAT AAC AAAAT GAT T C T AAAC C T C A 
C AT ATATT T T G TAT AAT T A T T T GAAAAAT T GCAGC T AAAGT TAT AGAACTT TAT GT TTAAAT 
AAGAAT CAT T T G C T T T GAG T T TT TAT AT T C CT T ACAC AAAAAGAAAAT ACAT AT GC AGT C T A 
G T CAGACAAAAT AAAG T T T T GAAGT GCTAC TATAATAAAT T T T T CAC GAGAACAAAC TTT GT 
AAAT C T T C CAT AAGC AAAAT GACAGC TAGTGCT TGGGAT C G TAC AT G T TAATT T TT TGAAAG 
ATAATTCTAAGTGAAATTTAAAATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGG 
AAAAATATGCATGCTTTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGATGAGTCAG 
GATAACAGAGAGATACCACATGACTCCAAAAAAAAAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA49829 
xsubunit 1 of 1, 436 aa, 1 stoo 
XMW: 49429, pi: 4.80, NX(S/T): 0 
MLKVSAVLCVCAAAWCSQSLAAAAAVA^ 
DEVEDDYFRTWSPGKPFDQALDPAKDPCLKMK^ 

GVDHRQWRGPILSTCKQCPVVYPSPVCGSDGKTYSFQCKLEYQACVLGKQISVKCEGHCPCP 
SDKPTSTSRNVKRACSDLEFREVA^ 

CKDSLGWMFNRLDTNYDLLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLISNNEWCYCFQ 
RQQDPPCQTELSNIQKRQGVKKLLGQYIPLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGS 
RINGVADCAIDFEISGDFASGDFHEWTDDEDDEDDIMraEDEIEDDDEDEGDDDDGGDDHDVYI 

Important features: 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern. 

amino acids 24 6-2 67 

N-myristoylation sites. 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeat proteins 

amino acids 353-365 and 339-352 
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CAGACTCCAGATTTCCCTGTCAACCACGAGGAGTCCAGAGAGGAAACGCGGAGCGGAGACAA 
C AG T AC C T GAC GCCTCTTT C AGC C C GGGAT C GCC C CAGCAGGGATgGGC GACAAGAT C T GGC 

TGCCCTTCCCCGTGCTCCTTCTGGCCGCTCTGCCTCCGGTGCTGCTGCCTGGGGCGGCCGGC 
TTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTTCCCGCCGGCCAGAAGGAGTGCTT 
CTACCAGCCCATGCCCCTGAAGGCCTCGCTGGAGATCGAGTACCAAGTTTTAGATGGAGCAG 
GATTAGATATTGATTTCCATCTTGCCTCTCCAGAAGGCAAAACCTTAGTTTTTGAACAAAGA 
AAATCAGATGGAGTTCACACTGTAGAGACTGAAGTTGGTGATTACATGTTCTGCTTTGACAA 

TACATTCAGCACCATTTCTGAGAAGGTGATTTTCTTTGAATTAATCCTGGATAATATGGGAG 
AAC AGGC AC AAGAAC AAGAAGAT T GGAAGAAATATAT TAG T GGC ACAGATATAT T GGATAT G 
AAACT GGAAGACAT C C T GGAAT CCATCAACAGCATCAAGT C CAGACTAAGCAAAAGTGGGCA 
C ATAC AAAT T C T G C T TAG AG CAT T TGAAGC T C G T GAT C GAAAC ATACAAGAAAGCAACT T T G 
AT AGAG T C AAT TTCTGGTC TAT GGTTAAT T TAGT GG T CAT GGTGGTGGTGT CAGC CAT T CAA 
GTTTATATGCTGAAGAGTCTGTTTGAAGATAAGAGGAAAAGTAGAACTS^AACTCCAAACT 
AGAGTACGTAACATTGAAAAATGAGGCATAAAAATGCAATAAACTGTTACAGTCAAGACCAT 
T AAT GGTCTTCTC C AAAAT AT T T T GAGAT ATAAAAGTAGGAAACAGGTATAAT T T TAAT GT G 

AAAATTAAGTCTTCACTTTCTGTGCAAGTAATCCTGCTGATCCAGTTGTACTTAAGTGTGTA 
ACAGGAAT AT T T T G C AG AAT ATAGG T T TAAC T GAAT GAAGC CATAT TAATAAC T GCAT T T T C 

CTAACTTTGAAAAATTTTGCAAATGTCTTAGGTGATTTAAATAAATGAGTATTGGGCCTAAT 
T G C AAC AC CAG TCTGTTTT T AACAGGTTC TAT TAC C CAGAAC T T TT T T G TAAA.T GC GGCAG T 
T AC AAAT TAAC T G T GGAAG T T T T CAGTT T TAAGT TAT AAAT C AC C T GAGAAT T AC CTAAT GA 

TGGATTGAATAAATCTTTAGACTACAAAAGCCCAACTTTTCTCTATTTACATATGCATCTCT 
C C TAT AAT G T AAAT AGAAT AAT AGCT T T GAAAT ACAAT T AGG T T T T T GAGAT T T T TATAAC C 
AAAT AC AT T T CAG T G TAAC ATAT TAGCAGAAAGCAT TAGT C T T T G T ACT TT GC TTACATTC C 
CAAAAGCTGACATTTTCACGATTCTTAAAAACACAAAGTTACACTTACTAAAATTAGGACAT 
GTTTTCTCTTTGAAATGAAGAATATAGTTTAAAAGCTTCCTCCTCCATAGGGACACATTTTC 
TCTAACCCTTAACTAAAGTGTAGGATTTTAAAATTAAATGTGAGGTAAAATAAGTTTATTTT 
TAATAGTATCTGTCAAGTTAATATCTGTCAACAGTTAATAATCATGTTATGTTAATTTTAAC 
ATGATTGCTGACTTGGATAATTCATTATTACCAGCAGTTATGAAGGAAATATTGCTAAAATG 
ATCTGGGCCTACCATAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGAAAACAGG 
AAAGAAT T T AG AAAAAC T T GAG AAAACC TAAT CCAAAATAAAAT TCACT TAAGT AGAAC TAT 
AAATAAAT AT C TAG AAT C T GAC T GGC T CATCAT GACAT C C TAC T CAT AACATAAAT CAAAGG 

AGATGATTAATTTCCAGTTAGCTGGAAGAAACTTTGGCTGTAGGTTTTTATTTTCTACAAGA 
AT TCTGGTTT GAAT TAT T T T T GTAAGCAGGTACAT T T T ATAAAAT G T AAGC C C TAC T G TAAG 
G T T T AGCAC T GG G T G TAC AT AT T TAT TAAAAAT T TT T AT TATAAC AAC T TT TAT TAAAAT GG 

CCTTTCTGAACACTTTATTTATTGATGTTGAAGTAAGGATTAGAAACATAGACTCCCAAGTT 
T TAAAC AC C T AAAT G T GAAT AAC C CAT ATATACAACAAAG T T T C T GC CAT CTAGCT T T T T GA 

AGTCTATGGGGGTCTTACTCAAGTACTAGTAATTTAACTTCATCATGAATGAACTATAATTT 
TTAAGTTATGCCCATTTATAACGTTGTTTATGACTACATTGTGAGTTAGAAACAAACTTAAA 
AT T T GGGGT AT AGAAC C C C T CAACAGGT TAGTAAT GC T GGAAT T C T T GAT GAGCAATAATGA 

TAACCAGAGAGTGATTTCATTTACACTCATAGTAGTATAAAAAGAGATACATTTCCCTCTTA 
GGCCCCTGGGAGAAGAGCAGCTTAGATTTCCCTACTGGCAAGGTTTTTAAAAATGAGGTAAA 
TGCCGTATATGATCAATTACCTTAATTGGCCAAGAAAATGCTTCAGGTGTCTAGGGGTATCC 
TCTGCAACACTTGCAGAACAAAGGTCAATAAGATCCTTGCCTATGAATACCCCTCCCTTTTG 
CGCTGTTAAATTTGCAATGAGAAGCAAATTTACAGTACCATT^ACTAATAAAGCAGGGTACAG 
ATAT AAAC TAC T GCAT C T T T T C TATAAAAC TGTGATTAAGAAT T C TAC CTCTCCTG TAT GGC 

TGTTACTGTACTGTACTCTCTGACTCCTTACCTAACAATGAATTTGTTACATAATCTTCTAC 
ATGTATGATTTGTGCCACTGATCTTAAACCTATGATTCAGTAACTTCTTACCATATAAAAAC 
GATAATTGCTTTATTTGGAAAAGAATTTAGGAATACTAAGGACAATTATTTTTATAGACAAA 
GTAAAAAGACAGATATTTAAGAGGCATAACCAAAAAAGCAAAACTTGTAAACAGAGTAAAA^ 
TCTTTAATATTTCTAAAGACATACTGTTTATCTGCTTCATATGCTTTTTTTAATTTCACTAT 
T C CAT T T C TAAAT T AAAG T T AT GCTAAAT T GAGTAAGC TG T T TAT C AC TTAACAGCT C AT T T 
TGTCTTTTT CAAT AT AC AAAT T T TAAAAATAC TACAAT ATT TAAC T AAGGCC CAACC G AT T T 
C CAT AAT G TAG CAG T TAC C G T G T TCACC T CACACTAAGGC C T AGAG TT T GC TCT GATAT GC A 
TTTGGATGATTAATGTTATGCTGTTCTTTCATGTGAATGTCAAGACATGGAGGGTGTTTGTA 
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ATTTTATGGTAAAATTAATCCTTCTTACAC^^ 

CACTTACAATTGTATGTCTCCTCAAATGAAGATTCTTTATGTGAAATTTTAAAAGACATTGA 

TTCCGCATGTAAGGATTTTTCATCTGAAGTACAATAATGCACAATCAGTGTTGCTCAAACTG 

CTTTATACTTATAAACAGCCATCTTAAATAAGCAACGTATTGTGAGTACTGATATGTATATA 
ATAAAAAT TAT CAAAGGAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs,min/ss .DNA52196 
xsubunit 1 of 1, 229 aa, 1 stop 
XMW: 26017, pi: 4.73, NX(S/T): 0 

MGDKIWLPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEIEY 
QVLDGAGLD I DFHLAS PE GKTLVFEQRKSDGVHTVETEVGDYMFCFDNT FST I SEKVI FFEL 
ILDNMGEQAQE QEDWKKYI TGTDILDMKLEDILES INS IKSRLSKSGHIQIiLRAFEARDRN 
I QESNFDRVN FW SMVNL WMWVS AI QVYMLKS L FE DKRKS RT 

Important features: 
Signal peptide: 

amino acids 1-2 3 

Transmembrane domain: 

amino acids 195-217 

N-myristoylation site. 

amino acids 43-48 

Tyrosine kinase phosphorylation site, 
amino acids 55-62 
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C CAT C C C T GAGAT C T T T T T ATAAAAAAC C CAGT C TT T GC T GAC CAGACAAAGCATAC CAGAT 

CT.CACCAGAGAGTCGCAGACACTMSCTGCCTCCCATGGCCCTGCCCAGTGTGTCCTGGATG 
CTGCTTTCCTGCCTCATTCTCCTGTGTCAGGTTCAAGGTGAAGAAACCCAGAAGGAACTGCC 
CTCTCCACGGATCAGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCCCTGCTATGCCTTGT 
TTTTGTCACCAAAATCCTGGATGGATGCAGATCTGGCTTGCCAGAAGCGGCCCTCTGGAAAA 
CTGGTGTCTGTGCTCAGTGGGGCTGAGGGATCCTTCGTGTCCTCCCTGGTGAGGAGCATTAG 
TAAC AG C TAG T CAT AC AT C T GGAT T GGGC T C CAT GAC C C CAC AC AGGGC T CT GAGC C T GAT G 

GAGAT GGAT GGGAGTGGAGTAGCACTGATGTGATGAATTACTTTGCATGGGAGAAAAATCCC 
TCCACCATCTTAAACCCTGGCCACTGTGGGAGCCTGTCAAGAAGCACAGGATTTCTGAAGTG 
GAAAGATTATAACTGTGATGCAAAGTTACCCTATGTCTGCAAGTTCAAGGACiaSGGCAGGT 
GGGAAGTCAGCAGCCTCAGCTTGGCGTGCAGCTCATCATGGACATGAGACCAGTGTGAAGAC 
TCACCCTGGAAGAGAATATTCTCCCCAAACTGCCCTACCTGACTACCTTGTCATGATCCTCC 
TTCTTTTTCCTTTTTCTTCACCTTCATTTCAGGCTTTTCTCTGTCTTCCATGTCTTGAGATC 
TCAGAGAATAATAATAAAAAT GT TAC TTTATAAAAAAAAAAAAAAAAAAAAAA 



AVrf&h 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss • DNA56965 
<subunit 1 of 1, 175 aa, 1 stop 
<MW; 19330, pi: 7.25, NX(S/T): 1 

MLPPHALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKSWM 
DADLACQKRP S GKL VS VLS GAEGS FVS SLVKS I3NS YS YI WI GLHDFTQGSEPDGDGWEWS 3 
TDVMNYFAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD 

Important features: 
Signal peptide: 

amino acids 1-26 

C-type lectin domain signature. 

amino acids 14 6-171 
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CCAGTCTGTCGCCACCTCACTTGGTGTCTGCTGTCCCCGCCAGGCAAGCCTGGGGTGAGAGC 
ACAGAGGAGTGGGCCGGGACCaJBaCGGGGGACGCGGCTGGCGCTCCTGGCGCTGGTGCTGGC 
TGCCTGCGGAGAGCTGGCGCCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAGGAGTGT 
CGGACTGTGTCACCATCGCCACCTGCACCACCAACGAAACCATGTGCAAGACCACACTCTAC 
TCCCGGGAGATAGTGTACCCCTTCCAGGGGGACTCCACGGTGACCAAGTCCTGTGCCAGCAA 
GTGTAAGCCCTCGGATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCTGCTGCAATACTG 
AGCTGTGCAATGTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTC 

CTCCCACTCTTGAGCCTCCGACTGIftfiAGTCCCCGCCCACCCCCATGGCCCTATGCGGCCCA 
GCCCCGAATGCCTTGAAGAA.GTGCCCCCTGCACCAGGAAAAAAAAAAAAAAAAA 
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FIGURE 185 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56405 
<subunit 1 of 1, 125 aa, 1 step 
<MW: 13115, pi: 5.90, NX(S/T): 1 
MRGTRIAXLALVLAACGELAPALRCYV^ 

FQGD S T VTXS CAS KCKP S DVDG I GQTL P VS CCNTELCNVDGAPALKS LHC GAL TLL PLLS LRL 

Important features: 
Signal peptide: 

amino acids 1-17 

N-glycosylation site. 

amino acids 4 6-49 

r 
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FIGURE 186 

CTGCAGTCAGGACTCTGGGACCGCAGGGGGCTCCCGGACCCTGACTCTGOVGCCGAACCGGC 
ACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTT 
GAGTCCTTCTGAGA2SATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGG 
TAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTC 
AATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTC 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
AC TAC C AG C C G TAG C C G T G C GCAGAGGAC GAGGAG T G C GGC AC TGAT GAGTAC T GC G C TAGT 

CCCACCCGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACG 

CTGCATGCGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTT 

CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 

CATAGCACCTTGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAA 

AGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTA 

GACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCAT 
AGGAG AAAA.GG C T C T C AT G GAC T AGAAAT AT T C CAG C G T T G T TAC T GTGGAGAAGG T C T GTC 

TTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGA 
GAC AC3&&AC CAG C TAT C C AAATGCAGT GAAC T C C T T T TAT ATAAT AGAT GC TAT GAAAAC C 

TTTTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCAT 
TCCAATAACACCTTCCAAAAACCTGGAGTGTAA.GAGCTTTGTTTCTTTATGGAA.CTCCCCTG 
TGATTGCAGTA^TTACTGTATTGTA^TTCTCAGTGTGGCACTTACCTGTAAATGCAATGA 
AACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTT 
TTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATT 
T CAG C T TAT AG T T C T T AAAAGC ATAACC C T T TAC C C CAT T T AAT T C TAGAG T C T AGAA.C GCA 
AGGAT C T C T T GGAAT G AC AAAT GAT AGG TAC C T AAAA.T G TAACAT GAAAAT AC TAGC T TAT T 

TTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGA 
AATAAAATTTAACATTTAAAAAAA&AAAAA 
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FIGURE 187 

</usr/ seqdb2/sst/DNA/Dnaseqs .min/ss .DNA57530 
<subunit 1- of 1, 2 66 aa, 1 stop 
<MW: 23672, pi: 8.35, NX(S/T): 1 
lyQffiLGAAGATRVFVAMVA^ 

APGILYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGG^^^ 

AMCCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEG 
SVCLRSSDCASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLEIFQRCYCGEGLSCRIQ 
KDHHQASNSSRLHTCQRH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 256-259 

Fungal Zn (2) -Cys (6) binuclear cluster domain 

amino acids 110-126 
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FIGURE 188 

TGTGTTTCCCTGCAGTCAGAATTTGGGACNGCAGGGGTTCCCGGACCTGATTTTGCAGCGGA 
ACGGGAAGGTTTTGTGGGACCCAGGTTGAAA.TGACGGTCATTTTTTTTTCTTTCTCCTTCNG 
GAGTCCTTNTGAGANGATGGTTTTGGGCGCAGCGGGAGCTAACCCGGTTTTTTGTNGCGATG 
GTAGCGGCGGTTTTCGGCGGCCACCTTNTGCTGGGAGTGAGCGCCACCTTGAATCGGTTTTC 
AATTCCAACGNTATCAAGAACCTGCCCCCACCGNTGGGCGGCGCTGCGGGGCACCCAGGNTT 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ATT AC CAGCCGT ACCC G T G C GCAGAGGAC GAGGAGTGCGGCACTGAT GAGTACTGCGCTAGT 

CCCACCCGCGGAGGGGANGCGGGCGTGCAAATNTGTNTNGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCANGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTNTT 
C T GAT C AAAAT C AT T T C C G AGGAGAAAT T GAGGAAAC CAT C AC T GAAAGC T T T GG TAAT GAT 
CATAGCACCTTGGATGGG 
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FIGURE 189A 



GAGGAACCTACCGGTACCGGCCGCGCGCTGGTAGTCGCCGGTGTGGCTGCACCTCACCAATC 
CCGTGCGCCGCGGCTGGGCCGTCGGAGAGTGCGTGTGCTTCTCTCCTGCACGCGGTGCTTGG 
GCTCGGCCAGGCGGGGTCCGCCGCCAGGGTTTGAGGATGGGGGAGTAGCTACAGGAAGCGAC 
CCCGC GAT GGCAAGGT AT AT TTTTGT GGAAT GAAAAGGAAGTATTAGAAATGAGCTGAAGAC 
CATTCACAGATTAATATTTTTGGGGACAGATTTGTGATGCTTGATTCACCCTTGAAGTAATG 
T AGAC AGAAG T T C T C AAAT T T GC ATATTACAT C AAC T GGAAC C AGCAGT GAAT CTTAAT GTT 
C AC T TAAAT CAGAAC TTG C ATAAGAAAGAGAAI£GGAGT CTGGT T AAAT AAAG AT GACTATA 
T CAGAGAC T T GAAAAGGAT CAT TCTCTGTTTTCT GATAGT GTATAT GGC CAT T T TAGT GGGC 
ACAGATCAGGATTTTTACAGTTTACTTGGAGTGTCCAAAACTGCAA.GCAGTAGAGAAATAAG 
ACAAGCTTTCAAGAAA.TTGG(^TTGAAGTTACATCCTGATAAAAACCCGAATAACCCAAATG 
CAC AT GGCGAT TT T T TAAAAATAAAT AGAGC ATAT G AAG T AC T C AAAG AT GAAGATCTACGG 
AAAAAGTATGACAAATATGGAGAAAAGGGACTTGAGGATAATCAAGGTGGCCAGTATGAAAG 
CTGGAACTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGG 
AAAGAAGAGAAT T T GAT GC T GC TGT TAAT T C T GGAGAAC T GTGGTT T GTAAAT T T TTACT C C 
C CAGG C T GT T CAC AC T G C CAT GATT TAGCT CC CAC AT GGAGAGACTT T GCTAAAGAAGT GGA 
TGGGTTACTTCGAATTGGAGCTGTTAACTGTGGTGATGATAGAATGCTTTGCCGAATGAAAG 
GAGTCAACAGCTATCCCAGTCTCTTCATTTTTCGGTCTGGAATGGCCCCAGTGAAATATCAT 
GGAGACAGAT CAAAGGAGAG T TTAGTGAGTTT TGCAATGCAGCATGTTAGAAGTACAGTGAC 
AGAACTTTGGACAGGAAATTTTGTCAACTCCATACAAACTGCTTTTGCTGCTGGTATTGGCT 
GGC T GAT CAC TTTTTGTT C AAAAGGAGG AGAT T GT T T GAC T T C ACAGAC ACG AC TC AGGCT T 
AG T GGC AT GTTGTTTCT C AAC T CAT T GGAT GC TAAAGAAATAT AT T T GGAAGTAAT ACATAA 
TCTTCCAGATTTTGAACTACTTTCGGCAAACACACTAGAGGATCGTTTGGCTCATCATCGGT 
GGCTGTTATTTTTTCATTTTGGAAAAAATGAAAATTCAAATGATCCTGAGCTGAAAAAACTA 
AA?UVCTCTACTTAAAAATGATCATATTCAAGTTGGCAGGTTTGACTGTTCCTCTGCACCAGA 
CATCTGTAGTAATCTGTATGTTTTTCAGCCGTCTCTAGCAGTATTTAAAGGACAAGGAACCA 
AAGAATATGAAATTCATCATGGAAAGAAGATTCTATATGATATACTTGCCTTTGCCAAAGAA 
AGTGTGAATTCTCATGTTACCACGCTTGGACCTCAAAATTTTCCTGCCAATGACAAAGAACC 
ATGGCTTGTTGATTTCTTTGCCCCCTGGTGTCCACCATGTCGAGCTTTACTACCAGAGTTAC 
GAAG AGCAT C AAAT C T T C T T TAT GG T CAGC T TAAG T T T GGT ACAC TAGAT T G TACAG T T CAT 
GAGGGAC T C T G T AAC AT G T ATAAC AT T C AGGC T TAT C CAACAACAGT GGT AT T C AAC C AG T C 
CAACAT T CAT GAG TAT G AAGGAC AT CAC T C T GCT GAACAAAT CT T GGAG T T C AT AG AGGAT C 
TTATGAATCCTTCAGTGGTCTCCCTTACACCCACCACCTTCAACGAA.CTAGTTACACAAAGA 
AAACACAACGAAGTCTGGATGGTTGATTTCTATTCTCCGTGGTGTCATCCTTGCCAAGTCTT 
AATGCCAGAATGGAAAAGAATGGCCCGGACATTAACTGGACTGATCAACGTGGGCAGTATAG 
ATTGCCAACAGTATCATTCTTTTTGTGCCCAGGAAAACGTTCAAAGATACCCTGAGATAAGA 
TTTTTTCCCCCAAAATCAAATAAAGCTTATCAGTATCACAGTTACAATGGTTGGAATAGGGA 
T GC T TAT T C C C T GAGAAT C T GGGGT C T AGGAT T TT TAC C T CAAGTAT C CACAGAT C TAACAC 
CTCAGACTTTCAGTGAAAAAGTTCTACAAGGGAAAAATCATTGGGTGATTGATTTCTATGCT 
CCTTGGTGTGGACCTTGCCAGAATTTTGCTCCAGAATTTGAGCTCTTGGCTAGGATGATTAA 
AGGAAAAG T G AAAG C T GG AAAAGTAGACT G T CAGGCTT AT GC TC AG AC AT GCCAGAAAGCT G 
GGATCAGGGCCTATCCAACTGTTAAGTTTTATTTCTACGAAAGAGCAAAGAGAAATTTTCAA 
GAAGAGCAGATAAATACCAGAGATGCAAAAGCAATCGCTGCCTTAATAAGTGAAAAATTGGA 
AACTCTCCGAAATCAAGGCAAGAGGAATAAGGATGAACTTISSlTAATGTTGAAGATGAAGAA 
AAAG T T TAAAAGAAAT T C T G ACAGAT GACAT C AGAAGACAC C TAT T TAGAAT G T TACATT TA 
TGATGGGAATGAATGAACATTATCTTAGACTTGCAGTTGTACTGCCAGAATTATCTACAGCA 
CTGGTGTAAAAGAAGGGTCTGCAAACTTTTTCTGTAAAGGGCCGGTTTATAAATATTTTAGA 
CTTTGCAGGCTATAATATATGGTTCACACATGAGAACAAGAATAGAGTCATCATGTATTCTT 
T GT TAT T T G C T T T T AAC AAC C TTTAAAAAATAT TAAAAC GAT T C TTAGC T C AGAGCCATACA 
AAAGTAGGCTGGATTCAGTCCATGGACCATAGATTGCTGTCCCCCTCGACGGACTTATAATG 
T TT C AGG T GGCTGGCTT G AACAT GAG T C T GC T G T GC TAT C T ACATAAATGT CTAAGT T GTAT 
AAAGTCCACTTTCCCTTCACGTTTTTTGGCTGACCTGAAAAGAGGTAACTTAGTTTTTGGTC 
ACT T G T T C T C C T AAAAAT G C TATC C C TAAC CATATATT TATAT T TCG T T TTAAAAACACC C A 
T GAT G T GG C AC AGT AAAC AAAC C C TGTTAT GCT GTATTATTAT GAGGAGAT TC TT CAT T GT T 
TTCTTTCCTTCTCAAAGGTTGAAAAAATGCTTTTAATTTTTCACAGCCGAGAAACAGTGCAG 
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CAG T ATAT GTGCACACAGTAAGTACACAAAT TT GAGCAACAGTAAGTGCACAAATTCTGTAG 
T T T GC T G TAT CAT C C AG G AAAACC T GAGGGAAAAAAAT TAT AGCAATTAACT GGGCATT GTA 

GAGTATCCTAAATATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATATGTGTTCAT 
G TAT T T T C T GAAAT T G CT T T CATAGAAATT T T C C CACT GAT AGTT GAT TT TT GAGGCAT CTA 

ATATTTACATATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTT 
TTCTTTCATAGTTTTGGTTTTTCACTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAAT 
TACTTTACAGGTTGTTTTACTGTAGCTTATAATGATACTGTAGTTATTCCAGTTACTAGTTT 
ACTGTCAGAGGGCTGCCTTTTTCAGATAAATATTGACATAATAACTGAAGTTATTTTTATAA 
GAAAATCAAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGTTTAGACTCAAA 
GAATCACAAATTTGTCAGTAAC\TGTAGTTGTTTAGTTATAATTCAGAGTGTACAGAATGGT 
AAAAAT T C C AAT CAG T C AAAAGAGG T CAAT GAAT TAAAAGGCT T GC AACT T T T T CAAAAAAA 
AAAAAAAAAA 
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FIGUR E 19 0 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56439 
<subunit I of 1, 747- aa, 1 stop 
<MW: 86127, pi: 7.46, NX(S/T): 2 

MGVWLNKDDYIRDLKRIILCFL^ 

HPDKNPNNPNAHGDFLKINRAYEVLCT^ 

YDDDPEIITLERREFDAAVNSGELWFVNFYSPGC^ 

GDDRMLCRMKGWS YP S L FT FRSGMAPVKYHGDRSKES LVS FAMQHVRSTVTELWTGNFVNS 
IQTAFAA.GIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHNLPDFELLSAN 
TLEDRLAHHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSAPDICSNLYVFQP 
SLAVFKGQGTKEYEIHHGKKILYDIIAFAKESW 

PPCRALLPELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHS 
AEQILEFIEDLMNPSWSLTPTTFNELVTQR^ 

LTGLINVGSIDCQQYHSFCAQENVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLG 
FLPQVSTDLTPQTFSEKVLQGKNHWIDFYAPW 

QAYAQTCQKAGIRAYPTVKFYFYERAKRNFQEEQIOT^ 
Important features: 

Endoplasmic reticulum targeting sequence . 

amino acids 744-747 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature. 

amino acids 77-96 

N-glycosylation site, 
amino acids 484-487 
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FIGURE 191 

AGACAGTACCTCCTCCCTAGGACTACACAA.GGACTGAACCAGAAGGAAGAGGACAGAGCAAA 

GCCMSAACATCATCCTAGAAATCCTTCTGCTTCTGATCACCATCATCTACTCCTACTTGGA 

GTCGTTGGTGAAGTTTTTCATTCCTCAGAGGAGAAAATCTGTGGCTGGGGAGATTGTTCTCA 

TTACTGGAGCTGGGCATGGAATAGGCAGGCAGACTACTTATGAATTTGCAAAACGACAGAGC 

ATATTGGTTCTGTGGGATATTAATAAGCGCGGTGTGGAGGAAACTGCAGCTGAGTGCCGAAA 

ACTAGGCGTCACTGCGCATGCGTATGTGGTAGACTGCAGCAACAGAGAAGAGATCTATCGCT 

CTCTAAATCAGGTGAAGAAAGAAGTGGGTGATGTAACAATCGTGGTGAATAATGCTGGGACA 

GTATATCCAGCCGATCTTCTCAGCACCAAGGATGAAGAGATTACCAAGACATTTGAGGTCAA 

CATCCTAGGACATTTTTGGATCACAAAAGCACTTCTTCCATCGATGATGGAGAGAAATCATG 

GCCACATCGTCACAGTGGCTTCAGTGTGCGGCCACGAAGGGATTCCTTACCTCATCCCATAT 

TGTTCCAGCAAATTTGCCGCTGTTGGCTTTCACAGAGGTCTGACATCAGAACTTCAGGCCTT 

GGGAAAAACTGGTATCAAAACCTCATGTCTCTGCCCAGTTTTTGTGAATACTGGGTTCACCA 

AAAATCCAAGCACAAGATTATGGCCTGTATTGGAGACAGATGAAGTCGTAAGAAGTCTGATA 

GATGGAATACTTACCAATAAGAAAATGATTTTTGTTCCATCGTATATCAATATCTTTCTGAG 

ACTACAGAAGTTTCTTCCTGAACGCGCCTCAGCGATTTTAAATCGTATGCAGAATATTCAAT 

TTGAAGCAGTGGTTGGCCACAAAATCAAAATGAAAUSAATAAATAAGCTCCAGCCAGAGATG 

TATGCATGATAATGATATGAATAGTTTCGAATCAATGCTGCAAAGCTTTATTTCACATTTTT 

TCAGTCCTGATAATATTAAAAACATTGGTTTGGCACTAGCAGCAGTCAAACGAACAAGATTA 

ATTACCTGTCTTCCTGTTTCTCAAGAATATTTACGTAGTTTTTCATAGGTCTGTTTTTCCTT 

TCATGCCTCTTAAAAACTTCTGTGCTTACATAAACATACTTAAAAGGTTTTCTTTAAGATAT 

T T TAT T T T T C CAT T TAAAGG T GGAC AAAAGC TAC CT CC CTAAAAG T AAAT AC AAAGAGAAC T 

TATTTACACAGGGAAGGTTTAAGACTGTTCAAGTAGCATTCCAATCTGTAGCCATGCCACAG 

AATATCAACAAGAACACAGAATGAGTGCACAGCTAAGAGATCAAGTTTCAGCAGGCAGCTTT 

ATCTCAACCTGGACATATTTTAAGATTCAGCATTTGAAAGATTTCCCTAGCCTCTTCCTTTT 

TCATTAGCCCAAAACGGTGCAACTCTATTCTGGACTTTATTACTTGATTCTGTCTTCTGTAT 

AACTCTGAAGTCCACCAAAAGTGGACCCTCTATATTTCCTCCCTTTTTATAGTCTTATAAGA 

TACATTATGAAAGGTGACCGACTCTATTTTAAATCTCAGAATTTTAAGTTCTAGCCCCATGA 

TAACCTTTTTCTTTGTAATTTATGCTTTCATATATCCTTGGTCCCAGAGATGTTTAGACAAT 

TTTAGGCTCAAAAATTAAAGCTAACACAGGAAAAGGAACTGTACTGGCTATTACATAAGAAA 
CAATGGACCCAAGAGAAGAA 
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FIGURE 192 

</usr/seqdb2/sst/DNA/Dnaseqs.xain/ss -DNA56409 
<subunit 1 of 1, 300 aa, 1 stop 
<MW: 33655, pi: 9.31, NX(S/T): 1 

MNI I LE I LLLL I T 1 1 YS YLE S LVKFFI PQRRKSVAGE IVL I TGAGHGI GRQTT YEFAKRQS I 
LVL WD INKRGVEE TAAE CRKLGVTAHAYWDCSNREE I YRSLNQVKKEVGDVT I WNNAGTV 
YPADLLSTKDEEITKTFEVNILGHFWITKALLPS 

SSKFAAVGFHRGLTSELQALGKTGIKTSCLCP T /FVNTGFTKNPSTRLWPVLETDEVVRSLID 
GIL TNKKM I FVP S Y IN I FLRLQKFL PERAS AI LNRMQN I Q FE AWGHK I KMK 

Important features : 
Signal peptide: 

amino acids 1-19 

cAMP- and cGMP -dependent protein kinase phosphorylation site, 
amino acids 30-33 and 58-61 

Short-chain alcohol dehydrogenase family protein 
amino acids 165-202, 37-49, 112-122 and 210-219 
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FIGURE 193 

CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGG 
ATGCCCCGGCTCTGCAGGAAGCTGflAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGC 
AGGMS^CCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCT 
GC T GAT CAT C G T G T AC T GGG AC AGC GCAGGC G C C GCGCAC T T C TAC TTGCACAC G T C CTT CT 
C T AGG C C GCAC AC GGGGCCGCCGCTGCC CACCC C C GGGC C G GAC AGGGACAGGGAGC T CAC G 
GCCGACTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGA 
CCTTCCCAGAAAGGAGACGGAGCAGCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAG 
GCTACGACTGGTCCCCGCGCGACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAG 
CGGAGGAGCGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCG 
CGCATTCGACGACATCCCCAACTCGGAGCTGAGCCACCTGATCGTGGACGACCGGCACGGGG 
CCATCTACTGCTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTG 
AGCGGAAGCCTGCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCA 
CGTGCACAACGCCAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCT 
CCCGCCACCTCATGAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTCGTGCGCGACCCC 
TTCGTGCGCCTGATCTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCG 
CAAGTTCGCCGTGCCCATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGC 
GCGAGGCCTTCCGCGCTGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGAC 
CCGCACACGGAGAAGCTGGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCA 
C C C G T G C C AGAT C GAC TAC GACT T CGT GGGGAAGC T GGAGAC T C T GGAC GAGGAC GC C GC GC 
AGCTGCTGCAGCTACTCCAGGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGG 
AC C GC C AGC AGC T G G G AGGAGGACT GG T T CGC C AAGAT CCC C CTGGC C TGGAGGC AGCAGCT 
GTATAAACTCTACGAGGCCGACTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCC 
GAGACIS^AAGCTTTCGCGTTGCTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCC 
AGTTTTTTTATGACCTACGATTTTGCAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCC 
AT T GAGTAC T GTAT CGAT AT T GT T T TT TAAGAT T AATATAT T T CAGG TAT T TAATAC GA 
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FIGURE 194 



</usr/seqdb2/sst/BNA/Dnaseqs .min/ss . DNA56112 
<subunit 1 of 1, 414 aa, 1 stop 
<MW: 48414, pi: 9.54, NX(S/T): 4 

HTKARLFRLWLVLGSVFMILLIIVYWDSAGAAHFYLKTSFSRPHTGPPLPTPGPDRDRELTA 

DSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAER 
RSVLRGFCANSSLAFPTKERAFDDIPNSELSHL 

GSLLHRGAPYRDPLRIPREHVHNASAHLTFNKFWRRYGK^ 

VRL I S AFRSKFELENEE FYRKFAVPMLRL YANHTS LPASAREAFRAGLKVS FANFI Q YLLDP 

HTEKLAPFNEHWRQVYRLCHPCQILYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRT 
ASSWEEDWFAKIPLAWRQQLYKLYEADFVLFGYPKPENLLRD . 

Important features: 
Signal peptide: 

amino acids 1-31 

N-glycosylation sites, 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGFR family cysteine-rich region protein 

amino acids 329-332 
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FIGURE 195 

TCGGGCCAGAATTCGGCACGAGGCGGCACGAGGGCGACGGCCTCACGGGGCTTTGGAGGTGA 
AAGAGGC C C AGAGTAGAGAGAGAGAGAGACC GACGTACAC GGG&ISGCTACGGGAAC GC GCT 

ATGCCGGGAAGGTGGTGGTCGTGACCGGGGGCGGGCGCGGCATCGGAGCTGGGATCGTGCGC 
GCCTTCGTGAACAGCGGGGCCCGAGTGGTTATCTGCGACAAGGATGAGTCTGGGGGCCGGGC 
CCTGGAGCAGGAGCTCCCTGGAGCTGTCTTTATCCTCTGTGATGTGACTCAGGAAGATGATG 
TGAAGACCCTGGTTTCTGAGACCATCCGCCGATTTGGCCGCCTGGATTGTGTTGTCAACAAC 
GCTGGCCACCACCCACCCCCACAGAGGCCTGAGGAGACCTCTGCCCAGGGATTCCGCCAGCT 
GCTGGAGCTGAACCTACTGGGGACGTACACCTTGACCAAGCTCGCCCTCCCCTACCTGCGGA 
AGAGTCAAGGGAATGTCATCAACATCTCCAGCCTGGTGGGGGCAATCGGCCAGGCCCAGGGA 
GTTCCCTATGTGGCCACCAAGGGGGCAGTAACAGCCATGACCAAAGCTTTGGCCCTGGATGA 
AAGTCCATATGGTGTCCGAGTCAACTGTATCTCCCCAGGAAACATCTGGACCCCGCTGTGGG 
AGGAGC T G GC AGC C T T AAT G C CAGAC C C TAGGGC CACAAT C C GAGAGGGCAT GC T GGC C C AG 

CCACTGGGCCGCATGGGCCAGCCCGCTGAGGTCGGGGCTGCGGCAGTGTTCCTGGCCTCCGA 
AGCCAACTTCTGCACGGGCATTGAACTGCTCGTGACGGGGGGTGCAGAGCTGGGGTACGGGT 
GCAAGGCCAGTCGGAGCACCCCCGTGGACGCCCCCGATATCCCTTCCIS^TTTCTCTCATTT 
CTACTTGGGGCCCCCTTCCTAGGACTCTCCCACCCCAAACTCCAACCTGTATCAGATGCAGC 
CCCCAAGCCCTTAGACTCTAAGCCCAGTTAGCAAGGTGCCGGGTCACCCTGCAGGTTCCCAT 
AAAAACGATTTGCAGCC 
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FIGURE 196 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56045 
<subunit 1 of 1, 270 aa, 1 stop 
<MW: 28317, pi: 6.00, NX(S/T): 1 

MATGTRYAGKVVWTGGGRGIGAGIVRAF^/NSGARWICDKDESGGRALEQELPGAVFILCD 
VTQEDDVKTLVSETIRRFGRLDCVVNNAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTKL 
ALP YLRKS QGNVIN I S S LVGAI GQAQAVP YVATKGAVTAMTKALALDES PYGVRVNC I S PGN 
IWTPLWEELAALMPDPRAT I REGMLAQPLGRMGQPAEVGAAAVFLASEANFCTGIELLVTGG 
AELGYGCKASRSTPVDAPDIPS 

Important features: 
N-glycosylation site. 

amino acids 138-141 

Short-chain alcohol dehydrogenase family protein 

amino acids 10-22, 81-91, 134-171 and 176-185 
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FIGURE 197 



AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGaAISGACTGGCCTCACAACCTG 
CTGTTTCTTCTTA.CCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAAAAGCAA 
GAGGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACC 
T G G T G T C AC GG AT GAAAC C G TAT GCC C GCAT G GAGGAG TAT GAGAGGAACAT C GAGGAGAT G 
GT GGC C CAGCTGAGGAACAGCTCAGAGC TGGC CCAGAGAAAGT GTGAGGTCAAC T T GCAGCT 
G T GGA T G T C CAAC AAGAG G AGC CTGTCTCCCT GGGGC TACAG CAT CAAC CAC GAC C C CAGCC 
GTATCCCCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACC 
ATGCAGGAGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCG 
CCTCTGCCCGCCACCGCCCCGCACAGGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCG 
CTGTGGGCTGCACCTGCATCTTCTSAATCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGA 
CCATCCTCCTTGCACCTTTGTGCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAA 
GCAAG 
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FIGURE 198 



</usr/seqdb2/sst/DNA/Dnaseqs •iain/ss .DNA59294 
<subunit 1 of 1, 130 aa, 1 stop 
<MW: 20437, pi: 9.58, NX (S/T) : 1 

MDWPHNLLFLLT I S I FLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEY 
ERNIEEHVAQLRNSSELAQRKCEWLQLWMSNKRSLSPWGYSINHDPSRIPVDLPEARCLCL 
GCVNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 75-7,8 

Homologous region to IL-17 

amino acids 96-180. 
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FIGURE 199 

GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGG 
CGAGCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCG 
GCGCCCAACAISGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGAT 
CGCGGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGG 
TCCAGCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGATGCTGAAATTT 
TACGCCCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGCTTTTGCAAAGAA 
TGGTGAAATACTTCAGATCAGTGTGGGGAAGGTAGATGTCATTCAAGAACCAGGTTTGAGTG 
GCCGCTTCTTTGTCACCACTCTCCCAGCATTTTTTCATGCAAAGGATGGGATATTCCGCCGT 
TAT CGTGGCC C AG GAAT C T T C GAAGAC C T GCAGAAT TATATCT T AGAGAAGAAAT G GCAAT C 

AGTCGAGCCTCTGACTGGCTGGAAATCCCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTC 
T T T T T AGC AT C T C T G G C AAG AT AT GGC AT C T T C ACAAC TAT T T C ACAG T GAG T C T T GGAAT T 

CCTGCTTGGTGTTCTTATGTGTTTTTCGTCATAGCCACCTTGGTTTTTGGCCTTTTTATGGG 
TCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGTGCCACTTCCAAGGCATTTATCTGAGC 
G T T CT GAG C AGAAT C GGAGATCAGAGGAGGCT CATAGAGC T GAACAGTT GCAGGAT GCGGAG 
GAGGAAAAAG AT G A T T CAAA. T GAAGAAGAAAAC AAAG ACAG C C T T GT AGAT GAT GAAGAAGA 
GAAAGAAGAT C T T GGC GAT G AGGAT GAAGC AGAGGAAGAAGAGGAGGAGGACAACT T GGCTG 
C T GG T G T G GAT GAGGAGAGAAGT GAGGC CAAT GATCAGGGGC C C C CAGGAGAGGAC GGTGTG 
ACCCGGGAGGAAGTAGAGCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGC 
T G AC AC AGAG G T G G T G GAAGAC T C C T T GAGGCAG C GT AAAAGT C AGC AT GC T GACAAGGGAC 
TGXBlSATTTAATGATGCGTTTTCAAGAATACACACCAAAACAATATGTCAGCTTCCCTTTGG 
CCTGCAGTTTGTACCAAA.TCCTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCT 
CTCTAGTCATTTGGTCTCATGGCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGT 
GAC AAT C AGGAT AT AG AAAAACAAAC G TAG T G T T GGGAT C T GT T T GGAGAC T GGGAT GGGAA. 
C AAG T T CAT T T AC T TAG G G G T C AGAGAG T C T C GAC C AGAGGAG GC CAT T C C C AG T C CT AAT C 
AGC AC C T T C C AG AGAC AAG G C T GC AGG C C C T G T GAAAT GAAAG C CAAGC AG GAGC C T T GGC T 
CCTGAGCATCCCCAAAGTGTA^CGTAGAAGCCTTGCATCCTTTTCTTGTGTAAAGTATTTAT 
T T T T G T C AAAT T G C AG G AAAC AT CAGGC AC C ACAG T G CAT GAAAAAT C T T T C ACAGC TAGAA 
ATTGAAAGGGCCTTGGGTATAGAGAGCAGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTG 
TGCTATGTTTTATTTCTTACCTTTAATTTTTCCAGCATTTCCACCATGGGCATTCAGGCTCT 
CCACACTCTTCACTATTATCTCTTGGTCAGAGGACTCCAATAACAGCCAGGTTTACATGAAC 
TGTGTTTGTTCATTCTGACCTAAGGGGTTTAGATAATCAGTAACCATAACCCCTGAAGCTGT 
GACTGCCAAACATCTCAAATGAAATGTTGTGGCCATCAGAGACTCAA?UVGGAAGTAAGGATT 
TTACAAGACAGATTAAAAAAAAATTGTTTTGTCCAAAATATAGTTGTTGTTGATTTTTTTTT 
AA.GTTTTCTAAGCAATATTTTTCAAGCCAGAAGTCCTCTAAGTCTTGCCAGTACAAGGTAGT 
CTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATCTCAAGGGGTTCCCTGGGTCTTGAAC 
TACTTTAATAATAACTAAAAAACCACTTCTGATTTTCCTTCAGTGATGTGCTTTTGGTGAAA. 
GAATTAATGAACTCCAGTACCTGAAAGTGAAAGATTTGATTTTGTTTCCATCTTCTGTAATC 
TTCCAAA.GAATTATATCTTTGTAAATCTCTCAATACTCAATCTACTGTAAGTACCCAGGGAG 
GCTAATTTCTTT 
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FIGURE 200 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56433 
<subunit 1 of 1, 349 aa, I stop 
<MW: 33952, pi: 4,34, NX(S/T): I 
MAGGRCGPQLTALLAAWJAAVAATAGPEEAALPPEQS 

WCPSCQQTDSEWEAFAKNGEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFRRYRG 
PGIFEDLQNYILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLKNYFTVTLGIPAW 
CSYVFFVIATLVFGLFMGLVLWISECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEK 
DDSNEEENKDSLVDDEEEKEDLGDEDEAEEEEEEDNLAAGVDEERSEANDQGPPGEDGVTRE 
EVE PEEAEE G I S E Q P C PADTEWEDS LRQRKS QHADKGL 

Important features : 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 191-211 

N-gl yco sy 1 a ti on s i te , 

amino acids 4 6-49 

Thxoredoxin family proteins. (homologous region to disulfide 

isomerase) 

amino acids 56-72 

Flavodoxin proteins 

amino acids 173-187 
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FIGURE 201 

AT C T GGT T GAAC T AC T TAAGCT TAATT T GT TAAACT CCGGTAAGTAC C TAGCC CACATGAT T 
T GAC T C AGAGAT TCTCTTTTGTC C AC AGACAG T CAT C T C AGGG G C AG AAAG AAAAGAG C T C C 
CAAAT GC TAT AT C TAT T C AG G GG C T C T CAAGAACAATgGAATAT CAT C C T GAT T T AGAAAAT 
T T G G A T GAAGAT G GAT AT AC T CAAT T ACAC T T C GAC T C T C AAAG CAATAC CAGGATAGCT G T 
TGTTTCAGAGAAAGGATCGTGTGCTGCATCTCCTCCTTGGCGCCTCATTGCTGTAATTTTGG 

GAATCCTATGCTTGGTAATACTGGTGATAGCTGTGGTCCTGGGTACCATGGGGGTTCTTTCC 
AG CCCTTGTCCTCC T AA T T GGAT T ATAT AT GAGAAGAGC T G T TAT C TAT T CAG CAT G T CAC T 
AAAT T C C T G G GAT G G AAG T AAAAGAC AAT GC T GG CAAC T GGGC T C TAAT C T C C TAAAGAT AG 
AC AG C T CAAAT GAAT T G G GAT T TAT AG TAAAAC AAG TGTCTTCC CAAC C T GAT AAT TC ATT T 
T GGAT AGG C CTTTCTCGGCCC CAGAC T GAGGTAC CAT GGC T CT GGGAGGAT GGAT CAACAT T 
C T C T T C T AAC T TAT T T CAG AT C AGAAC CACAG C TAC C C AAGAAAAC C CAT C T C CAAAT T GT G 
TATGGATTCACGTGTCAGTCATTTATGACCAACTGTGTAGTGTGCCCTCATATAGTATTTGT 
GAG AAGAAG T T T T CAAT G I&AGAGGAAGGGT GGAGAAGGAGAGAGAAAT AT GT GAG G TAGTA 
AG GAG GAC AGAAAAC AGAAC AGAAAAGAG TAACAGC T GAGG T C AAGATAAAT GC AGAAAAT G 
T T T AGAG AGC T T G GC CAAC T G TAAT C T TAAC CAAGAAAT T GAAGG GAGAGGC T GT GAT T T C T 
G TAT T T G T C GAC C TAC AG G T AGGC TAG T ATT AT T T T T C TAG T TAG T AGAT C C C TAGAC AT GG 
AATCAGGGCAGCCAAGCTTGAGTTTTTATTTTTTATTTATTTATTTTTTTGAGATAGGGTCT 
CACTTTGTTACCCAGGCTGGAGTGCAGTGGCACAATCTCGACTCACTGCAGCTATCTCTCGC 
CTCAGCCCCTCAAGTAGCTGGGACTACAGGTGCATGCCACCATGCCAGGCTAATTTTTGGTG 
TTTTTTGTAGAGACTGGGTTTTGCCATGTTGACCAAGCTGGTCTCTAACTCCTGGGCTTAAG 
TGATCTGCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCCACCACACCTGGC 
CCCAAGCTTGAATTTTCATTCTGCCATTGACTTGGCATTTACCTTGGGTAAGCCATAAGCGA 
ATCTTAATTTCTGGCTCTATCAGAGTTGTTTCATGCTCAACAATGCCATTGAAGTGCACGGT 
GTGTTGCCACGATTTGACCCTCAACTTCTAGCAGTATATCAGTTATGAACTGAGGGTGAAAT 
ATAT T T C T GAAT AGC T AAAT GAAGAAAT GGGAAAAAAT C T T CAC CACAG T C AGAGCAATT T T 

ATTATTTTCATCAGTATGATCATAATTATGATTATCATCTTAGTAAAAAGCAGGAACTCCTA 
CTTTTTCTT TAT CAAT T AAAT AG C T CAGAGAGTACAT C T GC C ATAT C T C T AATAGAAT C T T T 

TTTTTTTTTTTTTTTTTTTGAGACAGAGTTTCGCTCTTGTTGCCCAGGCTGGAGTGCAACGG 
CACGATCTCGGCTCACCGCAACCTCCGCCCCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCT 
C C C AAG TAG C T GGG AT TAC AG T CAG GC AC CAC CACAC C C GGC TAAT T T T GTAT T T T T T TAG T 

AGAGACAGGGTTTCTCCATGTCGGTCAGGGTAGTCCCGAACTCCTGACCTCAAGTGATCTGC 
CTGCCTCGGCCTCCCAAGTGCTGGGATTACAGGCGTGAGCCACTGCACCCAGCCTAGAATCT 
TGTATAATATGTAATTGTAGGGAAACTGCTCTCATAGGAAAGTTTTCTGCTTTTTAAATACA 
AAAATACATAAAAATACATAAAATCTGATGATGAATATAAAAAAGTAACCAACCTCATTGGA 
ACAAGTATTAACATTTTGGAATATGTTTTATTAGTTTTGTGATGTACTGTTTTACAATTTTT 
AC CAT T T T T T T C AG TAAT TACT G TAAAATGGTATTAT T GGAAT GAAAC TATATT T CC T CAT G 
TGCTGATTTGTCTTATTTTTTTCATACTTTCCCACTGGTGCTATTTTTA.TTTCCAATGGATA 
TTTCTGTATTACTAGGGAGGCATTTACAGTCCTCTAATGTTGATTAATATGTGAAAAGAAAT 
T G TAC CAAT T T TAC T AAAT TAT G CAG T T TAAAAT GGAT GAT T T TAT G T TAT G T GG AT T T CAT 
T T C AAT AAAAAAAAAC T C T TAT C AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 2Q 2 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss * DNA53912 
<subunit I of 1, 201 aa, 1 stop 
<MW: 22563, pi: 4.87, NX(S/T): 1 

MEYHPDLENLDEDGYTQLHFDSQSNTKIAWSEKGSCAASPPWRL2AVILGILCLVILVIAY 
VLGTMGYLSSPCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQ 

VSSQPDNSFWIGLSRPQTEVPWLWEDG5TFSSNLFQIRTTATQENPSPNCVWIEVSVIYDQL 
CSVPSYSICEKKFSM 

Important features : 

Type II transmembrane domain: 

amino acids 4 5-65 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 197-200 

N-myristoyiation sites . 

amino acids 35-40 and 151-156 

Homologous region to LDL receptor 

amino acids .34-67 and 70-200, 
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FIGURE 2Q3A 



GGAAGGGGAGGAGCAGGCCACACAGGCACAGGCCGGTGAGGGACCTGCCCAGACCTGGAGGG 

TCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGATCTTGGCTCATCGTAACCTCCACC 

TCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGTGGTG^C 

TTCCAAGAGTGACTCCGTCGGAGGAAAA2GACTCCCCAGTCGCTGCTGCAGACGACACTGTT 

CCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCACAGGGAAGAC T TTC 
GCTTCTGCAGC C AG C GG AAC CAGACAC ACAGGAGCAGC CT CC AC TACAAAC C C AC AC GAG? C 

CTGCGCATCTCCATCGAGAACTCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGC 
CCACCCTGCTTCCCGATCCTTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGA 
ACCGACATGCTGGGAGATTACATCTTCTCTATGGCAAGCGTGACTTCTTGCTGAGTGACAAA 
GCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCTGGCTCAGGGCCCCCCGCTGTT' 
AGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCCGCCAGCT 1 
TCACCTTCTCCTTCCACAGTCCTCCCCACACGGCCGCTCACAATGCCTCGGTGGACATGTGC 
GAGCTCAAAAGGGACCTCCAGCTGCTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAG 
GAGGCCCTCGGCTGCCCCCGCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAACTGACCTCTG 
TGAGATTCATGGGGGACATGGTGTCCTTCGAGGAGGACCGGATCAACGCCACGGTGTGGAAG 
C T C C AG C C C AC AG CCGGCCTC CAGGAC C T GCACAT CCAC T C C C GGCAG GAGG AGGAGC AGAG 

CGAGATCATGGAGTACTCGGTGCTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGGA 

GCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGAC 

AAGAATTCCAGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACCAAAGT 

AGCCAACCTCACGGAGCCCGTGGTGCTCACTTTCCAGCACCAGCTACAGCCGAAGAATGTGA 

CTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACATTGAGCAGCCCGGGGCATTGGAGCAGT 

GCTGGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGCAACCACTTGACCTA 

CTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGCC 

TCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTAC 

CTCTGCTCCAGGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCA 

CATGAACCTGCTGCTGGCCGTCTTCCTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGG 

CCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATCTTCCTGCACTTCTCCCTGCTC 

ACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTGGTGGAGGTCTT 

TGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCATCT 

TTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTGTG 

CATAGGACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGGTCAG 

CTACATCACCAACCTGGGCCTCTTCAGCCTGGTGTTTCTGTTCAACATGGCCATGCTAGCCA 

CCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCCAAAAGTGGTCACATGTGCTGACA 

CTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTTCTCCTTTGCTTC 

TGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCTCCTTCCAAGGCTTCC 

TCATCTTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTGAAG 

AGCAACTCAGACAGCGCCAGGCTCCCCATCAGCTCGGGCAGCACCTCGTCCAGCCGCATC2A. 

SGCCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATGCGGCCTCGTCGCACACTGCCTGT 

GGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGCAGACTTTGGAAAGCCCAACGACC 

ATGGAGAGATGGGCCGTTGCCATGGTGGACGGACTCCCGGGCTGGGCTTTTGAATTGGCCTT 

GGGGACTACTCGGCTCTCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCATGCTGCCTA 

GGGTACTGTCCCCACATCTGTCCCAACCCAGCTGGAGGCCTGGTCTCTCCTTACAACCCCTG 

GGCCCAGCCCTCATTGCTGGGGGCCAGGCCTTGGATCTTGAGGGTCTGGCACATCCTTAATC 

CTGTGCCCCTGCCTGGGACAGAAATGTGGCTCCAGTTGCTCTGTCTCTCGTGGTCACCCTGA 

GGGCACTCTGCATCCTCTGTCATTTTAACCTCAGGTGGCACCCAGGGCGAATGGGGCCCAGG 

GCAGACCTTCAGGGCCAGAGCCCTGGCGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGCAG 

CTCGCCTACCTCTGAGCCCAGGCCCCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTT 

CCCTGGGGTTCTCCTCCTCTCCCAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTCCCC 

GATTCCAATGCTGTTTTTTGGGGAGTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGT 

TTGTCTACTGCACAAGCCTCGGCCTGCCCCTGAGCCAGGCTCGGTACCGATGCGTGGGCTGG 

GCTAGGTCCCTCTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTGCTCACCCTGACC 

AAGCACACGCCTCAGAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCAAGAACTGTGG 

ACCATGCCAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACTGACCTCCTCTG 

GTGACACTGGCCTAGAGCCTGACACTCTCCTAAGAGGTTCTCTCCAAGCCCCCAAATAGCTC 




WO 99/46281 



PCT/US99/05028 



FIGURE 2Q3B 



CAGGC GCCCTCGGCCGCC CAT CAT GG T TAAT T C T GTC CAAC AAACAC AC ACGG GT AGAT T G C 
T GGC C T G T T GTAGG T GG T AGGGACACAGAT GA.C C GAC CT GGT CACT C CTC CT GC CAACATTC 
AGT C T GG T AT GT GAGG C GT GC GTGAAGCAAGAACTCCT GGAGC TACAGGGACAGGGAGC CAT 
CATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGACCTTGA 
AGAT G GG AAG GAT GTTCTTTT TAC GTAC CAAT TCTTTTGTCTTTT GATAT TAAAAAGAAGT A 
CAT GT T CAT T G T AG AG AAT T T GGAAAC T GT AGAAGAGAAT C AAGAAGAAAAAT AAAAAT CAG 
C T G T TGTAAT C GC C TAG CAAAAAAAAAAAAAAAAAAA?^^ 
AAAAAMAJU\AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 204 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA50921 
<subunit 1 of 1, 693 aa,, 1 stop 
<MW: 77738, pi: 3.87, NX(S/T): 7 

MTPQSLLQTTLFLLSLLFLVQGAHGRGKREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSE 
EALTVHAPFPAAHPASRSFFDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQH 
QEESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELKRDLQLL 
SQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVS FEEDRINATVWKLQPTAGLQD 

LHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 
I0/LGIWQNTKVANLTEPWLTFQHQLQPK1WTLQCVFWVEDPT 

TQT S C FCNHL T Y FAVLMVS S VEVDAVHKHYLSLLS YVGCWSALACLVT IAAYLCSRVPLPC 

RRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAIFLHFSLLTCLSWMGLE 

GYNLYRLWEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVDNYGPIILAVHRTPEGVIY 
PSMCWIRDSLVSYITNLGLFSLVFLF*n^^ 

LPWALI FFSFASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLP 
ISSGSTSSSRI 

Important features: 
Signal peptide: 

amino acids 1-25 

Putative transmembrane domains : 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Microbodies C-terminal targeting signal . 

amino acids 691-693 

oAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 198-201 and 370-373 

N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors family 2 proteins 
amino acids 47 5-504 , 
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FIGURE 205 



TGCCTGGCCTGCCTTGTCAACAATGCCGCTTACTCTGCTTCCAGGTTGCCCTGCCTTGCAGA 
GGAAANCNTCGGGACTACACCNTCAAGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTG 
GACACGAGCTTCCTGCTCAGCGNAGCCGGTGGCCCTGACAGGCTCTGAAGGCTGGCTGCCGA 
GCCAGTGCCATCTTCCTGCACTTCTCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGG 
TACAACCTCTACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAA 
GCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATG 
TGGACAA.CTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCT 
TCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCT 
GGTGTTTCTGTTCAACATGG 
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FI GURE 206 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTCCA 
GGTTTTGCTTT GAT C C T TTT CAAAAAC TGGAGACACAGAAGAGCGC TC TAGGAAAAAG T TTT 
GGATGGGATTATGTGGAAACTACCCTGCGATTCTCTGCTGCCAGAGCAGGCTCGGCGCTTCC 
ACCCCAGTGCAGCCTTCCCCTGGCGGTGGTGAAAGAGACTCGGGAGTCGCTGCTTCCAAAGT 
GCCCGCCGTGAGTGAGCTCTCACCCCAGTCAGCCAA^ISAGCCTCTTCGGGCTTCTCCTGCT 
GACATCTGCCCTGGCCGGCCAGAGACAGGGGACTCAGGCGGAATCCAACCTGAGTAGTAAAT 
T C C AG TTTTC C AGCAACAAGGAACAGAAC GGAGTACAAGAT C CTCAGCAT GAGAGAAT TATT 
ACT GT G T C TAG T AAT GGAAG TAT T C ACAGC C CAAGGT T T C CT CAT AC T TAT C C AAGAAATAC 
GGTCTTGGTATG GAG AT TAG T AGCAGT AGAGG AAAAT G T AT GG AT AC AAC T T AC G T T T GAT G 
AAAG AT TTGGGCTT GAAG AC C C AGAAGAT G AC ATAT G C AAG TAT GAT T T T GT AGAAG T T GAG 
GAAC C C AG T GAT G G AAC TAT AT TAG GGC GC TGGTGTGGTTCTGG T AC T G TAC C AGGAAAACA 
GAT T T C T AAAG GAAA T C AAAT TAGGATAAGAT T T G TAT C T GAT GAATAT TTTCCTTCT GAAC 
C AGG G T T C T G CAT C C AC T ACAAC AT T G T CAT G C C ACAAT T C ACAGAAGC T GT GAG T C C T T CA 
GTGCTACCCCCTTCAGCTTTGCCACTGGACCTGCTTAATAATGCTATAACTGCCTTTAGTAC 
C T T GGAAGAC C T TAT T C GAT AT C T T GAAC CAGAGAGAT GGCAGT T GGAC T T AGAAGAT C TAT 
ATAGGCCAACTTGGCAACTTCTTGGCAAGGCTTTTGTTTTTG.GAAGAAAATCCAGAGTGGTG 
GAT C T GAAC C T T C T AAC AG AG GAG G T AAG AT TAT AC AG C T G C AC AC C T C G T AAC T T C T C AG T 
GTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTA 
AACGCTGTGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCA 
AGCAAAG T TAC T AAAAAAT ACCAC GAGG T CCTT CAGTTGAGACCAAAGACC GGTG T CAGGGG 
ATT GC ACAAAT CAC TC AC C GAC GT GGCCC TGGAGCAC CAT GAGGAGT GT GAC T GTGT G T GCA 
G AGGG AG C AC AG GAG GATA£C C GC AT C AC CAC C AG C AG CTCTTGCC C AGAG C T GT GCAG T G C 
AGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCTT 
C AAGG AC C T T T CAT C T T C AGGAT T TAC AG T GC AT T C T GAAAGAGG AGAC AT C AAAC AGAAT T 
AGGAG T T G T GC AAC AG C T C T T T TGAGAG GAGGC C TAAAGGACAGGAGAAAAGG T C T T C AAT C 
G TGGAAAGAAAAT T AAAT GT T G TAT TAAATAGAT CAC C AGC TAGT T T CAGAGT T AC C AT GTA 
CGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTCGATACGGCTTAGGGTAATGTCAGT 
ACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAACTCTAAAGCTCC 
ATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCTCATATTCAC 
ATAT GT AAAC C AG AAC AT T C T AT GTAC T ACAAAC C T GGT T T T TAAAAAGGAAC TAT G T T GCT 
AT GAAT T AAAC TTGTGT CAT GCT GAT AGGAC AGACT GGATTTTT C AT AT TT CT T AT T AAAAT 
T T C T G C CAT T T AG AAGAAG AGAAC TAC AT T CAT GGT T T GGAAGAGATAAAC C T GAAAAGAAG 
AGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTAT 
ATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCTATTTTTAC 
CAAAG G TAT T T AATAT T C T T T T T TAT GACAAC T T AGAT CAAC TAT TTT T AGCT T GG TAAAT T 
T TT C T AAAC ACAAT T G T T AT AGC CAGAG GAACAAAGAT GATATAAAATATT GTTGCTCT GAC 
AAAAAT AC AT GT AT T T CAT T CT CGT AT GGT GCT AGAGT T AGATTAAT C T GCAT T T T AAAAAA 
C T GAAT T G GAAT AGAAT T GG T AAGT T GCAAAG AC TTT T TGAAAAT AAT TAAAT TAT CAT AT C 
T T C CAT T C C T G T TAT T GG AG AT GAAAATAAAAAGCAAC T TAT GAAAGT AGACATT CAGATC C 
AGC CAT TAC TAAC C T AT T C C TT T T T T GG GGAAAT C T GAGC CT AGC T CAGAAAAACATAAAG C 
ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACAT 
CCTATTTATTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACTTGTATAAAT 
AC AT GGAT AT TTT TAT GTAC AGAAGTATGT C T C T TAAC C AGT T CAC T TAT T GTAC T C T G GCA 
AT T TAAAAG AAAAT C AG T AAAAT ATT T T G CT T GTAAAAT GCT TAAT ATNGT G C CT AGGT TAT 
GTGGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGAATGTGGCTATTTTGG 
GGAGAAAAT TAAAAAAAAAAAAAAAAAAAAAG G T T TAG GGATAACAGGG TAAT G C GGC C 
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FIGURE 207 

MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERIITVSTNGSIHSPR 
FPHTYPRNTVLWRLVAVEENVWIQLTFDERFGLEDPEDDICKYLFVSVEEPSDGTILGRWC 
GSGTVPGKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLL 
NNAITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRV^/DLNLLTESVRLY 
3 C T PRN F S VS I RE E LKRT DT I FW P GCLL VKRCGGNCACCLHNCNE C QC VP SKVT KKYHEVLQ 
LRPKT GVRGLHKS L T DVALEHKEEC DC VCRGS T GG 
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FIGURE 208 

C CC AT C T CAAGC T GAT C T T GGCAC C T C T CAT GC TC TGCT C T CT T CAAC CAGACC T CTACAT T 
C CAT T T T GGAAGAAGAC T AAAAATSGT GTT TC CAAT GTGGACACTGAAGAGACAAAT TT A 

TCCTTTTTAACATAATCCTAATTTCCAAACTCCTTGGGGCTAGATGGTTTCCTAAAACTCTG 
C C C T GT GAT G T C AC T C T G GATGTT CCAAAGAACCATGT GATC GT GGAC T GCACAGAGAA GCA 
T T T G AC AGAAAT T C C T GG AGGTAT T C C C AC GAAC AC C AC GAAC C T CAC C C T C AC CAT TA£ C C 
AC AT AC C AG AC AT C T C C C C AGC G T C CT T T C ACAGAC T G GAC CAT C T GG T AGAGAT C GAT T 1 T C 
AG AT G CAAC T G T G T AC C TAT T C CAC T G GGGT CAAAAAAC AACAT G T GC AT CAAGAGGC T GC A 
GAT T AAAC C C AGAAG C T T T AGT GGAC T CAC T TAT T TAAAAT C C C T T T AC C T GGAT GGAAAC C 

AGCTACTAGAGATACCGCAGGGCCTCCCGCCTAGCTTACAGCTTCTCAGCCTTGAGGCCAAC 
AAC AT C T T T T C CAT C AG AAAAGAGAAT C T AACAGAAC T GG C C AAC AT AGAAATAC T C TAG C T 
GGG C C AAAAC T G T T AT TAT C GAAAT C C T T G T TAT G T T T C ATAT T C AATAGAGAAAGAT GCC T 

TCCTAAACTTGACAAAGTTAAAAGTGCTCTCCCTGAAAGATAACAATGTCACAGCCGTCCCT 
AC TGTTTTGC CAT C T AC T T T AAC AGAAC TAT AT C T C TAC AACAAC AT GAT T GCAAAAAT C CA 
AGAAG AT GAT T T T AAT AAC C T CAAC CAAT TACAAAT T C T T GAC C TAAGT GGAAAT T G C C C T C 

GTTGTTATAATGCCCCATTTCCTTGTGCGCCGTGTAAAAATAATTCTCCCCTACAGATCCCT 
GTAAAT GC T T T T GAT GC G CT GACAGAAT TAAAAGT T T TAC GTCTACACAGTAAC T CT CT T CA 
GC AT G T G C C C C C AAG AT G G T T TAAGAAC AT CAAC AAAC T C C AGGAAC T GGAT C T G T C C CAAA 
ACTTCTTGGCCAAAGAAATTGGGGATGCTAAATTTCTGCATTTTCTCCCCAGCCTCATCCAA 
TTGGATCTGTCTTTCAATTTTGAACTTCAGGTCTATCGTGCATCTATGAATCTATCACAAGC 
ATTTTCTTCACTGAAAAGCCTGAAAATTCTGCGGATCAGAGGATATGTCTTTAAAGAGTTGA 
AAAGCTTTAACCTCTCGCCATTACATAATCTTCAAAATCTTGAAGTTCTTGATCTTGGCACT 
AACTTTATAAAAATTGCTAACCTCAGCATGTTTAAACAATTTAAAAGACTGAAAGTCATAGA 
T CT T T CAGT GAATAAAATAT CACCTTCAGGAGATT CAAG T GAAG TTGGC T T CT GC TCAAAT G 
C C AGAAC T T C T G T AGAAAG T TAT GAAC C C C AG G T C C T GGAACAAT T ACAT TAT T T CAGAT AT 
GAT AAG TAT G CAAG GAG T T G C AGAT T C AAAAAC AAAGAGGC T T C T T T CAT G T C T GT T AAT GA 
AAG C T G C TAC AAG TAT GGG C AGAC C T T GGAT C T AAGTAAAAAT AG T ATAT T T T T T GT CAAGT 

CCTCTGATTTTCAGCATCTTTCTTTCCTCAAATGCCTGAATCTGTCAGGAAATCTCATTAGC 
C AAAC T C T T AAT G G C AG T GAAT TC CAAC C T T T AGCAGAG C T GAGAT AT T T GGAC T T C T C CAA 
CAAC C G GC T T GAT T TAC T C CAT T CAAC AGC AT T T GAAGAGC T T CAC AAAC T GGAAGT T CT GG 
ATATAAGCAGTAATAGCCATTATTTTCAATCAGAAGGAATTACTCATATGCTAAACTTTACC 
AAGAACCTAAAGGTTCTGCAGAAACTGATGATGAACGACAATGACATCTCTTCCTCCACCAG 
C AGGAC CAT GGAGAG T GAG T C T C T TAGAAC T C T GGAAT T C AGAGGAAAT CAC T T AGAT G TT T 
TAT G GAG AGAAG G T GAT AAC AG AT AC T TACAAT TAT T CAAGAAT C T GC TAAAAT T AGAGGAA 
TTAGACATCTCTAAAAATTCCCTAAGTTTCTTGCCTTCTGGAGTTTTTGATGGTATGCCTCC 
AAATCTAAAGAATCTCTCTTTGGCCAAAAATGGGCTCAAATCTTTCAGTTGGAAGAAACTCC 
AGTGTCTAAAGAACCTGGAAACTTTGGACCTCAGCCACAACCAACTGACCACTGTCCCTGAG 
AGAT TAT C CAAC T G T T C CAGAAGC C T CAAGAAT C T GAT T C T TAAGAATAAT C AAAT CAGGAG 

TCTGACGAAGTATTTTCTACAAGATGCCTTCCAGTTGCGATATCTGGATCTCAGCTCAAATA 
AAAT C CAGAT GAT C C AAAAG AC CAGC T T C C CAGAAAAT G T C C T CAACAAT C T GAAGAT GTT G 

CTTTTGCATCATAATCGGTTTCTGTGCACCTGTGATGCTGTGTGGTTTGTCTGGTGGGTTAA 
CCATACGGAGGTGACTATTCCTTACCTGGCCACAGATGTGACTTGTGTGGGGCCAGGAGCAC 
ACAAGGGCCAAAGTGTGATCTCCCTGGATCTGTACACCTGTGAGTTAGATCTGACTAACCTG 
ATTCTGTTCTCACTTTCCATATCTGTATCTCTCTTTCTCATGGTGATGATGACAGCAAGTCA 
CCTCTATTTCTGGGATGTGTGGTATATTTACCATTTCTGTAAGGCCAAGATAAAGGGGTATC 
AGCGTCTAATATCACCAGACTGTTGCTATGATGCTTTTATTGTGTATGACACTAAAGACCCA 
G C T G T GAC C GAG TGGGTTTTGGCT G AGC T G G T G G C CAAAC T G GAAG AC C C AAG AGAG AAAC A 

TTTTAATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTCTGGAAAACCTTT 
C C C AG AGC AT AC AG C T T AGC AAAAAGACAG T G T T T G T GAT G ACAGACAAGTAT GCAAAG AC T 
GAAAAT T T T AAGAT AG CAT T T TAC T TGT C C CAT C AGAGG C T CAT GGAT GAAAAAG T T GAT G T 
GATTATCTTGATATTTCTTGAGAAGCCCTTTCAGAAGTCCAAGTTCCTCCAGCTCCGGAAAA 
GGCTCTGTGGGAGTTCTGTCCTTGAGTGGCCAACAAACCCGCAAGCTCACCCATACTTCTGG 
CAGTGTCTAAAGAACGCCCTGGCCACAGACAATCATGTGGCCTATAGTCAGGTGTTCAAGGA 
AACGGTCmSCCCTTCTTTGCAAAACACAACTGCCTAGTTTACCAAGGAGAGGCCTGGC 
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FIGURE 209 

MVFPMWTLKRQILILFNIILISKLLGARWFPKTLPCDVTLDVPKNHVIVDCTDfGJLTEIPGG 
IPTNTTNLTLTINHIPDISPASFHRLDHLVEIDFRCNCVPIPLGSKNNMCIKRLQIKPRSFS 
GLTYLKSLYLDGNQLLEIPQGLPPSLQLLSLEANNIFSIRKENLTELANIEILYLGQNCYYR 
NPCYVSYSIEKDAFLNLTKLKVLSLKDNNVTAVPTVLPSTLTELYIYNNMIAKIQEDDFNNL 
NQLQILDLSGNCPRCYNAPFPCAPCKNNSPLQIPWAFDALTSLKVLRLHSNSLQIT/PPRWF 
KNINKLQELDLSQNFLAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSL 
KILRIRGYVFKELKS FNLSPLHNLQNLEVLDLGTNFIKIANLSMFKQFKRLKVIDLSVNKIS 
PSGDSSEVGFCSNARTSVESYEPQVLEQLHYFRYDKYARSCRFKNKEASFMSVNESCYKYGQ 
TLDLSKNS I FFVKS S DFQHLS FLKCLNLSGNLI SQTLNGSEFQPLAELRYLDFSNNRLDLLH 
S TAFEE LHKLE VL D I S S NS H YFQS EG I THMLNFTKNLKVL QKLMMNDND ISSSTS RTMESES 

LRTLE FRGNHLDVLWREGDNRYLQLFKNLLKLEELDI SKNSLS FLPSGVFDGMPPNLKNLSL 
AKNGLKSFSWKKLQCLKNLETLDLSHNQLTTVPERLSNCSRSLKNLILKNNQIRSLTKYFLQ 
DAFQLRYLDLS SNKI QM I QKT S FPENVLNNLKMLLLHHITOFLCTCDAAmFVWWVNHTEVT I P 
YLATDVTCVGPGAHKGQSVISLDLYTCELDLTNLILFSLSISVSLFLMVMMTASHLYFWDVW 
YIYHFCKAKIKGYQRLISPDCCYDAFIVYDTKDPAVTEWVLAELVAKLEDPREKHFNLCLEE 
RDWLPGQPVLENLSQS I QLSKKTVFVMTDKYAKTENFKIAFYLSHQRLMDEKVDVI IL I FLE 
KPFQKSKFLQLRKRLCGSSVLEWPTNPQAHPYFWQCLKNALATDNHVAYSQVFKETV 
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FIGURE 210A 



G G G TAG CAT TCTGCGCTGCT GC AAG T TAG GGAAT GAAAAAT T AGAACAAC AGAAACATQGAA 

AACATGTTCCTTCAGTCGTCAATGCTGACCTGCATTTTCCTGCTAATATCTGGTTCCTGTG^ 
G T TAT G C G C C GAAGAAAAT T T T T C TAGAAGC TAT C C T T G T GAT G AGAAAAAG CAAAAT GAC T 
C AG T TAT T G C AGAG T G C AG CAAT C GT C GAC TACAGGAAG T T C C C CAAAC GGT GGGCAAAT AT 
G T GAC AGAAC T AGA.C C T G T C T GAT AAT T T CAT C AC AC AC AT AAC GAAT GAAT GAT T T C AAG G 

GCTGCAAAATCTCACTAAAATAAATCTAAACCACAACCCCAATGTACAGCACCAGAACGGAA 
AT C C C G G TAT AC AAT C AAAT G G C T T GAATAT CACAGAC GGGGC AT T C C T CAAC C TAAAAAAC 
C T AAG G GAG T T AC T G C T T GAAGACAAC C AGT TAC C C C AAAT AC CCTCTGGTTTGC CAGAG T C 
T T T GAC AG AAC T TAG T C T AAT T C AAAAC AAT AT AT ACAACAT AAC TAAAGAGGGCAT T T C AA 
GAC T T AT AAAC T T GAAAAAT C T C TAT T T GGC C T GGAAC T GC TAT T T TAAC AAAGT T T GC GAG 
AAAAC T AAC AT AGAAG AT G G AGT AT T T GAAAC GC T GACAAAT T T GGAG T T GC TAT C AC TAT C 

TTTCAATTCTCTTTCACACGTGCCACCCAAACTGCCAAGCTCCCTACGCAAACTTTTTCTGA 
G CAAC AC C CAGAT CAAAT AC AT TAG T GAAGAAGAT T T C AAGGG AT T GAT AAAT T TAACAT TA 

CTAGATTTAAGCGGGAACTGTCCGAGGTGCTTCAATGCCCCATTTCCATGCGTGCCTTGTGA 
TGGTGGTGCTTCAATTAATATAGATCGTTTTGCTTTTCAAAACTTGACCCAACTTCGATACC 
TAAACCTCTCTAGCACTTCCCTCAGGAAGATTAATGCTGCCTGGTTTAAAAATATGCCTCAT 
CTGAAGGTGCTGGATCTTGAATTCAACTATTTAGTGGGAGAAATAGTCTCTGGGGCATTTTT 
AAC GAT GCTGCCCCGCT TAG AAAT ACT T GAC T TGTC T T T TAAC TATATAAAGGGGAGT TATC 
CAC AG CAT AT T AAT AT T T C C AGAAAC T T C T C T AAAC TTTTGTCTC TAC GGGCAT T GC AT T T A 

AGAGGTTATGTGTTCCAGGAACTCAGAGAAGATGATTTCCAGCCCCTGATGCAGCTTCCAAA 
C T TAT C GAC T A T CAAC T T G GGTAT TAATT T TAT TAAGC AAAT C GATT T CAAAC T T T T C CAAA 

ATTTCTCCAATCTGGAAATTATTTACTTGTCAGAAAACAGAATATCACCGTTGGTAAAAGAT 
ACCCGGCAGAGTTATGCAAATAGTTCCTCTTTTCAACGTCATATCCGGAAACGACGCTCAAC 
AGAT T T T GAG T T T GAC C CAC AT TCGAACTT T TAT CAT T T CACC C GT CC TT TAATAAAGC CAC 

AATGTGCTGCTTATGGAAAAGCCTTAGATTTAAGCCTCAACAGTATTTTCTTCATTGGGCCA 
AACCAATTTGAAAATCTTCCTGACATTGCCTGTTTAAATCTGTCTGCAAATAGCAATGCTCA 
AGTGTTAAGTGGAACTGAATTTTCAGCCATTCCTCATGTCAAATATTTGGATTTGACAAACA 
ATAGACTAGACTTTGATAATGCTAGTGCTCTTACTGAATTGTCCGACTTGGAAGTTCTAGAT 
C T C AGC TAT AAT T CAC AC TAT T T CAGAATAGC AGGC G TAACAC AT CAT C TAGAAT T TAT T C A 
AAAT T T CAC AAAT C T AAAAG T T T T AAAC T T GAGC CACAACAAC AT T TATAC T T TAACAGATA 
AGTATAACCTGGAAAGCAAGTCCCTGGTAGAATTAGTTTTCAGTGGCAATCGCCTTGACATT 
TTGTGGAATGATGATGACAACAGGTATATCTCCATTTTCAAAGGTCTCAAGAATCTGACACG 
TCTGGATTTATCCCTTAATAGGCTGAAGCACATCCCAAATGAAGCATTCCTTAATTTGCCAG 
CGAGTCTCACTGAACTACATATAAATGATAATATGTTAAAGTTTTTTAACTGGACATTACTC 
CAGCAG TTTCCTCGTCTC GAGT TGCT TGACT TAC GT GGAAACAAACTAC TC T TTT TAAC TGA 
TAGC C TAT C T GAC T T T ACAT C T TCC CT TCGGACAC T GC T G CT GAGT CATAACAGGATT TCC C 
ACCTACCCTCTGGCTTTCTTTCTGAAGTCAGTAGTCTGAAGCACCTCGATTTAAGTTCCAAT 
C T G C TAAAAAC AAT CAAC AAAT C C GCAC T T GAAAC T AAG AC CAC CAC CAAAT TAT C TAT G T T 
G GAAC TAC AC G GA&AC C C C T T T GAAT GC AC C T G T GAC AT T GGAG AT T T C C GAAGAT GGAT GG 
ATGAACATCTGAATGTCAAAATTCCCAGACTGGTAGATGTCATTTGTGCCAGTCCTGGGGAT 
CAAAGAGGGAAGAGTATTGTGAGTCTGGAGCTAACAACTTGTGTTTCAGATGTCACTGCAGT 
, GATATTATTTTTCTTCACGTTCTTTATCACCACCATGGTTATGTTGGCTGCCCTGGCTCACC 
ATTTGTTTTACTGGGATGTTTGGTTTATATATAATGTGTGTTTAGCTAAGGTAAAAGGCTAC 
AGG T C T C T T T C CAC AT C C CAAAC T T T C TAT GAT GC T T ACAT T T C T TAT GACACCAAAGATG C 

CTCTGTTACTGACTGGGTGATAAATGAGCTGCGCTACCACCTTGAAGAGAGCCGAGACAAAA 
ACGTTCTCCTTTGTCTAGAGGAGAGGGATTGGGACCCGGGATTGGCCATCATCGACAACCTC 
AT GCAG AG CAT CAAC C AAAG CAAGAAAAC AG TAT T T GT T T TAAC C AAAAAAT AT GCAAAAAG 
C T GGAAC T T T AAAAC AG C T T T T TAC TTGGCTTTG C AGAGG C TAAT GGAT GAGAACAT GGAT G 
T GAT T ATAT T TAT C C T G C T G GAGC CAG T GT T AC AGCAT T C T C AG TAT T T GAG GC TAC GGC AG 
CGGATCTGTAAGAGCTCCATCCTCCAGTGGCCTGACAACCCGAAGGCAGAAGGCTTGTTTTG 
GCAAACTCTGAGAAATGTGGTCTTGACTGAAAATGATTCACGGTATAACAATATGTATGTCG 
AT T C CAT T AAG CAATAC^SAC T G AC GT TAAGT CAT GAT T T C G C G C CATAAT AAAG AT G CAAA 

GGAATGACATTTCTGTATTAGTTATCTATTGCTATGTAACAAATTATCCCAAAACTTAGTGG 
TT TAAAAC AACACAT TT GC T GG C C CAC AGT T TTT GAGGG T CAG GAGT C CAGGCC C AGC ATAA 
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FIGURE 21QB 

CTGGGTCCTCTGCTCAGGGTGTCTCAGAGGCTGCAATGTAGGTGTTCACCAGAGACATAGGC 
ATCACTGGGGTCACACTCATGTGGTTGTTTTCTGGATTCAATTCCTCCTGGGCTATTGGCCA 
AAGGCTATACTCATGTAAGCCATGCGAGCCTCTCCCACAAGGCAGCTTGCTTCATCAGAGCT 
AGCAAAAAAGAGAGGTTGCTAGCAAGATGAAGTCACAATCTTTTGTAATCGAATCAAAAA^G 
TGATATCTCATCACTTTGGCCATATTCTATTTGTTAGAAGTAAACCACAGGTCCCACCAGCT 
CCATGGGAGTGACCACCTCAGTCCAGGGAAAACAGCTGAAGACCAAGATGGTGAGCTCTGAT 
TGCTTCAGTTGGTCATCAACTATTTTCCCTTGACTGCTGTCCTGGGATGGCCTGCTATCTTG 
ATGATAGATTGTGAATATCAGGAGGCAGGGATCACTGTGGACCATCTTAGCAGTTGACCTAA 
CACATCTTCTTTTCAATATCTAAGAACTTTTGCCACTGTGACTAATGGTCCTAATATTAAGC 
TGTTGTTTATATTTATCATATATCTATGGCTACATGGTTATATTATGCTGTGGTTGCGTTCG 
GTTTTATTTACAGTTGCTTTTACAAATATTTGCTGTAACATTTGACTTCTAAGGTTTAGATG 
CCATTTAAGAACTGAGATGGATAGCTTTTAAAGCATCTTTTACTTCTTACCATTTTTTAAAA 

GTATGCAGCTAAATTCGAAGCTTTTGGTCTATATTGTTAATTGCCATTGCTGTAAATCTTAA 
AATGAATGAATAAAAATGTTTCATTTTACAAAAAAAAAAAAAAAA 
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FIG UR E 211 



MENMFLQSSMLTCIFLLISGSCELCAEENFSRSYPCDEKKQNDSVIAECSNRRLQEVPQTVG 
KYVTELD L S DN FI TH I TNE S FQGLQNL TKINLNHNPNVQHQNGNF G I Q SNGLNI TBGAFLNL 
KNLRELLLEDNQLPQIPSGLPESLTELSLIQNNIYNI^ 

CEKTNIEDGVFETLTNLELLSLSFNSLSHVPPKLPSSLRKLFLSNTQIKYISEEDFKGLINL 
TLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNM 
PHLKVLDLE FNYLVGE I VSGAFLTMLPRLE ILDLS FNY I KG SYPQH IN I SRNFSKLLSLRAL 
HLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLV 
KDTRQSYAISISSSFQRHIRKRRSTDFEFDPHSNFYHFTRPLIKPQCAAYGKALDLSLNSIFFI 
GPNQFENLPDIACLNLSANSNAQVLSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEV 
LDLSYNSHYFRIAGVTHHLEFIQNFTNLKVLNLSKNNIYTLTDKYNLESKSLVELVFSGNRL 
D I LWNDDDNRY I S I FKGLKNLTRLDLSLNRLKHI PNEAFLNLPASLTELHINDNMLKFFNWT 
LLQQFPRLELLDLRGNKLLFLTDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSLKHLDLS 
SNLLKTINKSALETKTTTKLSMLELHGNPFECTCDIGDFRRWMDEHLNVKIPRLVDVICASP 
GDQRGKSIVSLELTTCVSDVTAVILFFFTFFITT^ 

GYRS L S T S QT FY DAY I S YDTKDAS VT DWVINELRYHLEE SRDKNVLLCLEERDWDPGLAI I D 
NLMQSINQSKKTVFVLTKKYAKS 

RQRI CKS S I LQWPDNPKAEGLFWQTLRNWLTENDSRYNNMYVDS IKQY 
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FIGURE 212 



CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 
CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 
CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 
AAGGGC T AGG G T C CAT C T C C AGTC C CAGGACACAGCAGC GGC CAC CAT GGC C ACGC C T GGGC 
TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGA 
GCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAA 
GGCCACCCCGCCTGGAGGCACAGGCC^TGAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTT 
CTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGT 
CCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCA 
CCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGC 
CGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGAC 
CAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGA 
GCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCA 
GATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGCCGG 
CAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGC 
C CAAG GG AG GG C C C C C C AG GGTGGCCCC C AAC C C GAC AGGAG TGGAC AGT GCAAT GAAGGAA 
GAAG T GC AGAG GC T GC AG T C C AGG GT GGAC C T G C T GGAGGAGAAGC T GC AGC TGGTGCTGGC 
CCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCC 
TGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTG 
GAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGEsACTGCCCAGCGCCCCAGGCTG 
GACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAG 
AAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCC 
TTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCAC 
CCCTGGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTG 
AGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAG 
GCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAPA^ 

CGACCTGCAGAAGC T TGGCCGCCATGGCCCAACTTGT TTAT T GCAGCT TATAATGGT TACAAAT 
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FI GURE 2 13 



MRGS QEVLLMWLLVLAVGGTEHA.YRPGRRVCAVRAHGDPVSES FVQRVYQ? FLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPFCRNGG3CVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPT G VDS AHKEE VQRLQS R VDLLEEKLQL^/LAPLHS LAS QALSHGL PDPGS LLVH5 FQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS ■ 
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FI GURE 21 4 



GCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGCGCCTGAGGCCCCAGCAAG 

GGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCC 
AG C AG CAT C AG AGC AGC CCCTGTGGTTGG CAGCAAAGT T CAGC T T GGC T GGG C C C GC T GT GA 

GGGGCTTCGCGCTACGCCCTGCGGTGTCCCGAGGGCTGAGGTCTCCTCATCTTCTCCCTAGC 
AGTGGATGAGCAACCCAACGGGGGCCCGGGGAGGGGAACTGGCCCCGAGGGAGAGGAACCCC 
AAAGC CACAT CT GTAGC CAGGATGAGCAGTGTGAATC CAGGCAGCCCC CAGGACCGGGGAGG 
CACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTC 
TCCTCCGC C AGG C C AC C C AG AGGAGAAG G C C AC C C C GC C T GGAGGC AC AGGC CATSAGGGGC 
TCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTA 
CCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCG 
TGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTAC 
CGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTA 
CGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATAT 
GCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCA 
GGAT GGCGGGGT GAC AC T T G C CAG T CAGAT GTGGAT GAAT GC AG T GC TAGGAGGGGC GGC T G 
TCCCCAGCGCTGCATCAACACCGCCGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCC 
TGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCG 
ACAGGAG T GGAC AG T GCAAT GAAGGAAGAAGTGCAGAGGCTGCAGTCCAGGGT GGACCT GC T 
GGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGC 
ATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGAC 
TCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGA 
CTCGTSACTGCCCAGCGCTCCAGGCTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATG 
CCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGC 
AGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGAT 
GGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACCCCCACCCTGGCTACCCCAACGGCA 
T C C C AAG G C CAG G T GGAC C C T CAGC T GAGGGAAGGTAC GAGC T C C CT G CT GGAGC C T GGGAC 
CCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGAC 
CCCCAGCACAATAAAAATGAAACGTG 
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FIGURE 215 



MRGSQEVLLMWLLVLAVGGTSH^YRFGRRVCAVRAHGDPVSESr/QRVYQPFLTTCDGHRAC 
S T YRT I YRTAYRRS PGLAPARPRYACCPGWKRTSGLPGACGAAI CQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 

PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 
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FIGURE 216 



CCCACGCGTCCGAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGACAGGCCAGGCA 
GGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAGGG 
T C CAT C T C C AG T C C C AG GAC ACAGCAGC G G C C AC CAT GGC CAC GCCTGGGCTC C AGC AGC AT 

CAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGC 
CCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGC 
CTGGAGGCACAGGCCAISAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGC 
AGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACG 
GGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGAC 
GGGCACCGGGCCT GCAGCACCTACCGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGG 
GCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTC 
CTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCAT.GCCGGAACGGAGGGAGCTGTGTCCAG 
CCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGA 
ATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCGTCAACACCGCCGGCAGTTACTGGT 
GCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGG 
CCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAG 
GCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACA 
GCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCC 
TT C CAG CAGC T C G GC CGCAT C GAC TC CC TGAGC GAGCAGATTT C CT TC C T GGAGGAGCAGCT 

GGGGTCCTGCTCCTGCAAGAAAGACTCGTQ^CTGCCCAGCGCCCCAGGCTGGACTGAGCCCC 
TCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCG 
GGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAG 
GCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACC 
C C CAC C C T G G C T AC C C C AAC GG C AT C C CAAGGC CAGGT GGGC C C T CAGC T GAGGGAAGG TAC 

GAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGG 
C C T CAG TGGGGGCTGCT GC C T GAC C CCC AGC ACAAT AAAAAT GAAAC GT G 
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FIGURE 217 



MRG S QE VLLMW L L VL AVGG T E HAYR PGRRV CAVRAHGD P V S E S FV QR VYQ P FL T T C BGHRAC 
S TYRTI YRTAYRRS PGLAPARPRYACCPGWKRTSGLPGACGAAI CQPPCRNGG3CVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCTOTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVKSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 
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FIGURE 218 



GGTTGCCACAGCTGGTTTAGGGCCCCGACCACTGGGGCCCCTTGTCAGGAGGAGACAGCCTC 
CCGGCCCGGGGAGGACAAGTCGCTGCCACCTTTGGCTGCCGACGTGATTCCCTGGGACGGTC 
CGTTTCCTGCCGTCAGCTGCCGGCCGAGTTGGGTCTCCGTGTTTCAGGCCGGCTCCCCCTC 
CTGGTCTCCCTTCTCCCGCTGGGCCGGTTTATCGGGAGGAGATTGTCTTCCAGGGCTAGCA? 
TTGGACTTTTGATGATGTTTGACCCAGCGGCAGGAATAGCAGGCAACGTGATTTCAAAGCTG 
GGCTCAGCCTCTGTTTCTTCTCTCGTGTAATCGCAAAACCCATTTTGGAGCAGGAATTCCAA 
TCATSTCTGTGATGGTGGTGAGAAAGAAGG'TGACACGGAAATGGGAGAAACTCCCAGGCAGG 
AACACCTTTTGCTGTGATGGCCGCGTCATGATGGCCCGGCAAAAGGGCATTTTCTACC T GAC 
CCTTTTCCTCATCCTGGGGACATGTACACTCTTCTTCGCCTTTGAGTGCCGCTACCTGGCTG 
TTCAGCTGTCTCCTGCCATCCCTGTATTTGCTGCCATGCTCTTCCTTTTCTCCATGGCTACA 
CTGTTGAGGACCAGCTTCAGTGACCCTGGAGTGATTCCTCGGGCGCTACCAGATGAAGCAGC 
TTTCATAGAAATGGAGATAGAAGCTACCAATGGTGCGGTGCCCCAGGGCCAGCGACCACCGC 
CTCGTATCAAGAATTTCCAGATAAACAACCAGATTGTGAAACTGAAATACTGTTACACATGC 
AAGATCTTCCGGCCTCCCCGGGCCTCCCATTGCAGCATCTGTGACAACTGTGTGGAGCGCTT 
CGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTACCGCTACTTCTACC 

TCTTCATCCTTTCTCTCTCCCTCCTCACAATCTATGTCTTCGCCTTCAACATCGTCTATGTG 
GCCCTCAAATCTTTGAAAATTGGCTTCTTGGAGACATTGAAAGAAACTCCTGGAACTGTTCT 

AGAAGTCCTCATTTGCTTCTTTACACTCTGGTCCGTCGTGGGACTGACTGGATTTCATACTT 
TCCTCGTGGCTCT C AAC C AG ACAAC C AAT GAAGAC AT CAAAGGAT C AT GGAC AGGGAAGAAT 

CGCGTCCAGAATCCCTACAGCCATGGCAATATTGTGAAGAACTGCTGTGAAGTGCTGTGTGG 

CCCCTTGCCCCCCAGTGTGCTGGATCGAAGGGGTATTTTGCCACTGGAGGAAAGTGGAAGTC 

GACCTCCCAGTACTCAAGAGACCAGTAGCAGCCTCTTGCCACAGAGCCCAGCCCCCACAGAA 

CACCTGAACTCAAATGAGATGCCGGAGGACAGCAGCACTCCCGAAGAGATGCCACCTCCAGA 

GCCCCCAGAGCCACCACAGGAGGCAGCTGAAGCTGAGAAG3&GCCTATCTATGGAAGAGACT 

TTTGTTTGTGTTTAATTAGGGCTATGAGAGATTTCAGGTGAGAAGTTAAACCTGAGACAGAG 

AGCAAGTAAGCTGTCCCTTTTAACTGTTTTTCTTTGGTCTTTAGTCACCCAGTTGCACACTG 

GCATTTTCTTGCTGCAAGCTTTTTTAAATTTCTGAACTCAAGGCAGTGGCAGAAGATGTCAG 

TCACCTCTGATAACTGGAAAAATGGGTCTCTTGGGCCCTGGCACTGGTTCTCCATGGCCTCA 

GCCACAGGGTCCCCTTGGACCCCCTCTCTTCCCTCCAGATCCCAGCCCTCCTGCTTGGGGTC 

ACTGGTCTCATTCTGGGGCTAAAAGTTTTTGAGACTGGCTCAAATCCTCCCAAGCTGCTGCA 

CGTGCTGAGTCCAGAGGCAGTCACAGAGACCTCTGGCCAGGGGATCCTAACTGGGTTCTTGG 

GGTCTTCAGGACTGAAGAGGAGGGAGAGTGGGGTCAGAAGATTCTCCTGGCCACCAAGTGCC 

AGCATTGCCCACAAATCCTTTTAGGAATGGGACAGGTACCTTCCACTTGTTGTANNNNNNNN 

NNNNNNNNNNNNNNNNNNNTTGTTTTTCCTTTTGACTCCTGCTCCCATTAGGAGCAGGAATG 

GCAGTAATAAAAGTCTGCACTTTGGTCATTTCTTTTCCTCAGAGGAAGCCCGAGTGCTCACT 

TAAACACTATCCCCTCAGACTCCCTGTGTGAGGCCTGCAGAGGCCCTGAATGCACAAATGGG 

AAACCAAGGCACAGAGAGGCTCTCCTCTCCTCTCCTCTCCCCCGATGTACCCTCAAAAAAAA 

AAAAATGCTAACCAGTTCTTCCATTAAGCCTCGGCTGAGTGAGGGAAAGCCCAGCACTGCTG 

CCCTCTCGGGTAACTCACCCTAAGGCCTCGGCCCACCTCTGGCTATGGTAACCACACTGGGG 

GCTTCCTCCAAGCCCCGCTCTTCCAGCACTTCCACCGGCAGAGTCCCAGAGCCACTTCACCC 

TGGGGGTGGGCTGTGGCCCCCAGTCAGCTCTGCTCAGGACCTGCTCTATTTCAGGGAAGAAG 

ATTTATGTATTATATGTGGCTATATTTCCTAGAGCACCTGTGTTTTCCTCTTTCTAAGCCAG 

GGTCCTGTCTGGATGACTTATGCGGTGGGGGAGTGTAAACCGGAACTTTTCATCTATTTGAA 
GGCGATTAAACTGTGTCTAATGCA 
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FIGURE 219 

MS VMWRKKVTRKWE KL 2 GKNT FC CDGRVMMARQKG I FYL XL FL I LG T C TL FFAFECRYLAV 
QLSPAIPVFAAMLFLFSMATLLRTSFSDPGVIPRALPDEAAFIEMEIEATNGAVPQGQRPP? 
RIKNFQ INNQI VKLKYCYTCKI FRPPRASHCS I CDNCVERFDHHCPWVGNCVGKRNYRYFYL 

FILSLSLLTIYVFAFNIVYVALKSLKIGFLETLKETPGTVLEVLICFFTLWSWGLTGFHTF 

LVALNQTTNEDIKGSWTGKNRVQNPYSHGNIVKNCCEVLCGPLPPSVLDRRGILPLEESGSR 
PPSTQETSSSLLPQSPAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK 
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FIGURE 220 



AAAAC C C T G TAT T T T T TACAAT GCAAATAGACAATNANC C T GGAGGT C T T T GAAT TAGGTAT 
TATAGGGATGGTGGGGTTGATTTTTNTTCCTGGAGGCTTTTGGCTTTGGACTCTCNCTTTCT 
C C C AC AGAGCNC T T C GAC C AT CAC TGCCCCTGGGT GGGGAAT T G T G T T GGAAAGAGGAAC T A 
CCGCTANTTCTACCTCTTCATCCTTTNTCTCTCCCNCCTCACAATCTATGTCTTCGCCTTCA 
ACATCGT 
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F1QURE 221 



GTTGTGTCCTT C AGC AAAAC AG T GGAT T TAAAT C T C C T T GC ACAAGCT T GAGAGGAACACAA 
TCTAT C AG GAAAGAAAGAAAGAAAAAAAC C GAAC CT G ACAAAAAAGAAGAAAAAGAAGAAGA 
AAAAAAAT C ATS AAAAC CAT CCAGC CAAAAAT GCACAATT CT AT CT CT TGGGCAAT CT T CAC 
GGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCC 
CCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATT 
GACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGA 
CAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAA.CGCAGTACAGCATCG 
AGAT C CAGAACG T GGATG T G TATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAAC 
CAC C CAAAG AC C T C T AGGG T C CAC CT CAT T GT G CAAGT AT CT CCCAAAAT T GT AGAGAT T T C 
T T C AG AT AT C T C CAT T AAT GAAG GGAAC AATAT T AGC C T CAC C T GCAT AGCAAC T GG T AGAC 
CAGAGCCTACGGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGAC 
G AA T AC T T GG AAAT T C AGG G CAT CAC C C G G GAG C AG T C AGGGGACTAC GAG T GC AG T GC CT C 
CAAT GAC GT GGC CGCGCCCGT GGTAC GGAGAGTAAAGGTCACCGTGAAC TAT CCAC C ATACA 
T T T C AGAAG C C AAGGGT AC AGG T GTCCCCGT GGGACAAAAGGGGACACT GCAG T G T GAAGC C 
T C AG C AGT C C C C T C AG C AG AAT T C C AGT GG T AC AAGGAT GACAAAAGAC T GAT T GAAG GAAA 
GAAAGGGGTGAAAGTGGAAAACAGACCTTTCCTCTCAAAACTCATCTTCTTCAATGTCTCTG 
AACATGAC TATGGGAACTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGC 
ATCATGCTATTTGGTCCAGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGG 
CTGCGTCTGGCTGCTGCCTCTTCTGGTCTTGCACCTGCTTCTCAAATTTTSATGTGAGTGCC 
AC T T C CC C AC C C G GGAAAG G C T G C C GC C AC CAC CAC CAC CAACACAACAGC AAT GGCAACAC 
CGACAGCAACCAAT CAGATATATACAAATGAAATTAGAAGAAACACAGCCTCAT GGGACAGA 
AAT T T GAGGGAGGGGAACAAAGAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAATTGAA 
AAT T G C C T T GC AGAT AT T T AGGTACAATGGAGT T TTC TT T TCC CAAAC GGGAAGAACACAGC 
ACACCCGGCTTGGACCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAA 
GGGCTCAGCCTCTCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCA 
AATTCAATCAGTCCATAGAGACGAACAGAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGG 
GCACTTTGGTAGACTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAA 
AAAAA 
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FIGURE 222 



MKTIQ PKMHN SISWAI FTG LAALC L FQGVPVRS GDAT FP KAMDNVT VR QGE SAT LRC T I DNR 

VTRVAWLNRSTILYAGNDKWCLDPRWLLSNTQTQYSIEIQNVDVYDEGFYTCSVQTDNHPK 
TSRVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYL 
E I QG I T RE Q S G DYE C S AS ND VAAP WRRVKVT YN YP P Y I SEAKGTGVPVGQKGTLQCEASAV 

PSAEFQWYKDDKRLIEGKKGVKVENRPFLSKLIFFWSEHDYGNYTCVASNKLGHTNASIML 
FGPGAVSEVSNGTSRRAGCVWLLPLLVLHLLLKF 
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FIGURE 223 

G AAAAAAAAT CAT GAAAAC CAT C CAGC CAAAAAT G C AC AAT T C TAT C T C T T GG G C AAT C T T C 

ACGGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTT 
CCCCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTA 
TTGACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAAT 
GACAAG TGGTGCCTG GAT C CTCGCGTGGTCCTTCT GAGCAACAC C CAAACGCAGTACAGCAT 

CGAGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACA 
ACCACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATT 
T C T T C AGAT AT CTC CAT T AAT GAAGGGAACAAT ATTAGC C T CAC CT GC ATAGCAAC T GG TAG 
ACCAGAG 



• 
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FIGURE 224 



ATGGCTGGTGACGGCGGGGCCGGGCAGGGGACCGGGGCCGCGGCCCGGGAGCGGGCCAGCTG 

CCGGGAGCCCTGAATCACCGCCTGGCCCGACTCCACCMS5AACGTCGCGCTGCAGGAGCTGG 
GAG C T G G C AG C AAC G T G G GAT T C C AGAAG GGGACAAGAC AG C T G T T AGG C T C AC GCAC G GAG 

CTGGAGCTGGTCTTAGCAGGTGCCTCTCTACTGCTGGCTGCACTGCTTCTGGGCTGCCTTGT 
GGCCCTAGGGGTCCAGTACCACAGAGACCCATCCCACAGCACCTGCCTTACAGAGGCCTGCA 
TTCGAGTGGCTGGAAAAATCCTGGAGTCCCTGGACCGAGGGGTGAGCCCCTGTGAGGACTTT 
TAC CAGTTCTCCTGTGGGGGCTGGATTCGGAGGAACCCCCTGCCCGATGGGCGTTCTCGCTG 
GAAC AC C T T C AAC AG C C T C T GGG AC CAAAAC CAG GC CAT AC T G AAGCAC C T GC T T GAAAACA 
C CAC C T T CAAC T C CAGCAGT GAAG C T GAGCAGAAGACACAGC GC T T C TAC C TAT C T T GCC TA 
CAG G T G G AGC G CAT T GAG GAG C T G G GAG C C C AGC CAC T GAGAGAC C T CAT T GAG AAG AT T GG 
T G G T T G GAAC AT TAC GGGGCCCT G GGAC C AGGACAAC T T TAT G GAGG T G T T GAAGG CAG TAG 
CAGGGACCTACAGGGCCACCCCATTCTTCACCGTCTACATCAGTGCCGACTCTAAGAGTTCC 
AACAGCAATGTTATCCAGGTGGACCAGTCTGGGCTCTTTCTGCCCTCTCGGGATTACTACTT 
AAAC AG AAC T G C C AAT G AGAAAG T G C T CAC T GC C TAT C T GG AT TAC AT G GAGGAAC T GG GG A 

TGCTGCTGGGTGGGCGGCCCACCTCCACGAGGGAGCAGATGCAGCAGGTGCTGGAGTTGGAG 
AT AC AG C T G GC CAAC AT CAC AG T G C C C CAG G AC CAG CGG C GC GAC GAGGAGAAGAT C TAC C A 

CAAGATGAGCATTTCGGAGCTGCAGGCTCTGGCGCCCTCCATGGACTGGCTTGAGTTCCTGT 
CTTTCTTGCTGTCACCATTGGAGTTGAGTGACTCTGAGCCTGTGGTGGTGTATGGGATGGAT 
T A T T T G CAG C AG G T G T C AGAGC T CAT CAAC C GCAC GGAAC C AAGCAT C C T GAACAAT TAC C T 
GAT C T G GAAC C T G G T GC AAAAGACAAC C T CAAGC C T GGAC C GAC GC T T T GAGT C T GCAC AAG 
AGAAGCTGCTGGAGACCCTCTATGGCACTAAGAAGTCCTGTGTGCCGAGGTGGCAGACCTGC 
ATCTCCAACACGGATGACGCCCTTGGCTTTGCTTTGGGGTCACTCTTCGTGAAGGCCACGTT 
T GAC CGG C AAAG C AAAGAAAT T G C AGAG G GGAT GAT CAG C GAAAT C C GGAC C GCAT T T GAGG 
AGG C C C T G G GAC AG C T G G T T T GG AT GGAT GAGAAGAC C C GC C AGGC AGC C AAG GAGAAAGC A 
GAT G C CAT C TAT GAT AT GAT TGGTTTCC C AGAC T T TAT C C T GGAGC C C AAAGAG C T GGAT GA 
T GT T TAT GAC GGGTAC GAAAT T T C T GAAGAT TC T T TCT TCC AAAACAT G T TGAAT TT GTACA 

ACTTCTCTGCCAAGGTTATGGCTGACCAGCTCCGCAAGCCTCCCAGCCGAGACCAGTGGAGC 
AT GAC C C C C C AGAC AG T G AAT GCC TAC TAC C TT C CAAC T AAGAAT GAGAT CGTCTTCCCCGC 

TGGCATCCTGCAGGCCCCCTTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCA 
TCGGTGTGGTCATGGGCCATGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGAC 
AAAGAAGGGAACCTGCGGCCCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACAC 
GG C C T G CAT G GAGG AAC AG T ACAAT CAAT AC C AGG T CAAT GGGGAGAG G C T CAAC GGC C G C C 

AGAC GC T G GGGGAGAAC AT TAC TGACAACGGGGGGCTGAAGGCTGCCTACAATGCTTACAAA 
GCATGGCTGAGAAAGCATGGGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCACCA 
GCTCTTCTTCGTGGGATTTGCCCAGGTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACG 
AGGGGCTGGTGACCGACCCCCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAAC 
TCCCGTGACTTCCTGCGGCACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCT 
GTGTGAGGTGTGGTASACCTGGATCAGGGGAGAAATGGCCAGCTGTCACCAGACCTGGGGCA 
GCTCTCCTGACAAAGCTGTTTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGT 
C TAG TCCCTCCCCCC CAC AG G T GAC AT GAGT ACAGACC C T C C T CAAT CAC C ACAT T GT GC CT 

CTGCTTTGGGGGTGCCCCTGCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGT 
GTCACCCTGCCTGGAAGAGGTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCC 
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FIGURE 225 



MIWALQELGAGSNVGFQKGTRQLLGSRTQLELVXAGASLLIAM.LLGCLVALGVQYHRDPSK 
STCLTEACIRVAGKILESLDRGVSPCEDFYQFSCGGWIRRNPLPDGRSRWNTFNSLWDQNQA 
ILKHLLENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQPLRDLIEKIGGWNITGPWDQDN 
mEVLKAVAGTYRATPFFTWISADSKSSNSNVIQVDQSGLFLPSRDYYLNRTANEKVLTAY 
LDYMEELGMLLGGRPTSTREQMQQVLELEIQLANITVPQDQRRDEEKIYHKMSISELQALAP 
SMDWLEFLSFLLSPLELSDSEPVWYGMDYLQQVSELINRTEPSILNNYLIWNLVQKTTSSL 
DRRFESAQEKLLETLYGTKKSCVPRWQTCISNTDDALGFALGSLFVKATFDRQSKEIAEGMI 
SEIRTAFEEALGQLVl^DEKTRQAJ^KADAIYDMIGFPDFILEPKELDDVYDGYEISEDSF 
FQNMLNL YN FS AKVMADQLRKP P SRDQWSMT PQT VNAYYLP TKNE I VFPAG I LQAP FYARNH 
PKALNFGGIGVVMGHELTHAFDDQGREYDKEGNLRPWWQ 

NGERLNGRQTLGENITDNGGLKAAYNAYKAWLRKHGEEQQLPAVGLTNHQLFFVGFAQWCS 
VRTPESSHEGLVTDPHSPARFRVLGTLSNSRDFLRHFGCPVGSPMNPGQLCEVW 
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FIGURE 225A 

GCCCGGCCCTCCGCCCTCCGCACTCCCGCCTCCCTCCCTCCGCCCGCTCCCGCGCCCTCCTC 
CCTCCCTCCTCCCCAGCTGTCCCGTTCGCGTCATGCCGAGCCTCCCGGCCCCGCCGGCCCCG 
CTGCTGCTCCTCGGGCTGCTGCTGCTCGGCTCCCGGCCGGCCCGCGGCGCCGGCCCAGAGCC 
CCCCGTGCTGCCCATCCGTTCTGAGAAGGAGCCGCTGCCCGTTCGGGGAGCGGCAGGTAGGT 
GGGCGCCCGGGGGAGGCGCGGGCGGGGAGTCGGGCTCGGGGCGAGTCAGCGCCAGCCCGGAG 
GGGGCGCGGGGCGCAGGTGGCTCGGCGCGGCGGGCGGCCCGGAGGGTGGGCGGGGGCAGAAG 
GGCGCGGTGCCTGGGACCGGGGACCCGCGGGCAGCCCCCGGGGCGGCACACGGCGCGAGCTG 
GGCAGCGGCCTCCAGCCAAGCCCGTCCCCGCAGGCTGCACCTTCGGCGGGAAGGTCTATGCC 
TTGGACGAGACGTGGCACCCGGACCTAGGGGAGCCATTCGGGGTGATGCGCTGCGTGCTGTG 
CGCCTGCGAGGCGCAGTGGGGTCGCCGTACCAGGGGCCCTGGCAGGGTCAGCTGCAAGAACA 
TCAAACCAGAGTGCCCAACCCCGGCCTGTGGGCAGCCGCGCCAGCTGCCGGGACACTGCTGC 
CAGACCTGCCCCCAGGACTTCGTGGCGCTGCTGACAGGGCCGAGGTCGCAGGCGGTGGCACG 
AGCCCGAGTCTCGCTGCTGCGCTCTAGCCTCCGCTTCTCTATCTCCTACAGGCGGCTGGACC 
GCCCTACCAGGATCCGCTTCTCAGACTCCAATGGCAGTGTCCTGTTTGAGCACCCTGCAGCC 
CCCACCCAAGATGGCCTGGTCTGTGGGGTGTGGCGGGCAGTGCCTCGGTTGTCTCTGCGGCT 
CCTTAGGGCAGAACAGCTGCATGTGGCACTTGTGACACTCACTCACCCTTCAGGGGAGGTCT 
GGGGGCCTCTCATCCGGCACCGGGCCCTGTCCCCAGAGACCTTCAGTGCCATCCTGACTCTA 
GAAGGCCCCCACCAGCAGGGCGTAGGGGGCATCACCCTGCTCACTCTCAGTGACACAGAGGA 
CTCCTTGCATTTTTTGCTGCTCTTCCGAGGCCTTGCAGGACTAACCCAGGTTCCCTTGAGGC 
T C C AG AT T C T AC AC C AG G G G C AG C TAG T GC GAGAAC T T C AGGC C AAT G T C T C AG C C CAGGAA 
CCAGGCTTTGCTGAGGTGCTGCCCAACCTGACAGTCCAGGAGATGGACTGGCTGGTGCTGGG 
GGAGCTGCAGATGGCCCTGGAGTGGGCAGGCAGGCCAGGGCTGCGCATCAGTGGACACATTG 
CTGCCAGGAAGAGCTGCGACGTCCTGCAAAGTGTCCTTTGTGGGGCTAATGCCCTGATCCCA 
GTCCAAACGGGTGCTGCCGGCTCAGCCAGCCTCACTCTGCTAGGAAATGGCNCCCTGATCCT 
CCAGGTGCAATTGGTAGGGACAACCAGTGAGGTGGTGGCCATGACACTGGAAACCAAGCCTC 
AGCGGAGGGATCAGCCCACTGTCCTGTGCCACATGGCTGGCCTATCCTCCCCTGCCCCCAGG 
CCGTGGGTATCTGCCCTGGGCTGGGGTGCCCGAGGGGCTCATATGCTGCTGCAGAATGAGCT 
CTTCCTGAACGTGGGCACCAAGGACTTCCCAGACGGAGAGCTTCGGGGGCAACGTGGCTGCC 
CTGCCCTACTGTGGGGCATAGCGCCCGCCCTGCCCGTGCCCCTAGCAGGAGCCCTGGTGCTA 
CCCCCTGTGAAGAGCCAAGCAGCAGGGCACGCCTGGCTTTCCTTGGATACCCACTGTCACCT 
GCACTATGAAGTGCTGCTGGCTGGGCTTGGTGGCTCAGAACAAGGCACTGTCACTGCCCACC 
TCCTTGGGCCTCCTGGAACGCCAGGGCCTCGGCGGCTGCTGAAGGGATTCTATGGCTCAGAG 
GCCCAGGGTGTGGTGAAGGACCTGGAGCCGGAACTGCTGCGGCACCTGGCAAAAGGCATGGC 
TTCCCTGATGATCACCACCAAGGTAGCCCCAGAGGGGAGCTCCGAGGGCAGCCTCTCCTCCC 
AGGTGCACATAGCCAACCAATGTGAGGTTGGCGGACTGCGCCTGGAGGCGGCCGGGGCCGAG 
GGGGTGCGGGCGCTGGGGGCTCCGGATACAGCCTCTGCTGCGCCGCCTGTGGTGCCTGGTCT 
CCCGGCCCTAGCGCCCGCCAAACCTGGTGGTCCTGGGCGGCCCCGAGACCCCAACACATGCT 
TCTTCGAGGGGCAGCAGCGCCCCCACGGGGCTCGGTGGGCGCCCAACTACGACCCGCTCTGC 
TCACTCTGCACCTGCCAGAGACGAACGGTGATCTGTGACCCGGTGGTGTGCCCACCGCCCAG 
CTGCCCACACCCGGTGCAGGCTCCCGACCAGTGCTGCCCTGTTTGCCCTGGCTGCTATTTTG 
ATGGTGACCGGAGCTGGCGGGCAGCGGGTACGCGGTGGCACCCCGTTGTGCCCCCCTTTGGC 
T T AAT T AAGT GTGCTGTCTG C AC CT GCAAGCAGGGGGGC AC T G GAGAGGT GC AC T GT GAGAA 
GGTGCAGTGTCCCCGGCTGGCCTGTGCCCAGCCTGTGCGTGTCAACCCCACCGACTGCTGCA 
AACAGTGTCCAGGTGAGGCCCACCCCCAGCTGGGGGACCCCATGCAGGCTGATGGGCCCCGG 
GGCTGCCGTTTTGCTGGGCAGTGGTf CCCAGAGAGTCAGAGCTGGCACCCCTCAGTGCCCCC 
GTTTGGAGAGATGAGCTGTATCACCTGCAGATGTGGGGTAAGTGGGGAGCAGAGGCTTGTGT 
G AGG T G GG T AC T GG GAGC C T G G T C T GG AG TAG GGAGAC C T T C C CAGGGAGG T C C CT GAAGAA 
GCTGAAGGTCACTGTGTCCCAGTGCCTCTGGGGGACACTCAGTGTCTGCTCTGTCTTGTACC 
AGGCAGGGGTGCCTCACTGTGAGCGGGATGACTGTTCACTGCCACTGTCCTGTGGCTCGGGG 
AAGGAGAGTCGATGCTGTTCCCGCTGCACGGCCCACCGGCGGCGTAAGTGAGGGAGTCCAGG 
GT C AG C AG C T GT GAG T GGAG GGC T CAC CTGCCTGT GGGAC T CC T GAT CAGGGAAGGGAGCAC 
TCACTGTGTGCAGGAACAGTGCAGCCTGCCTCACAAGTGCCATTCCAATCCACCCTCACAGC 
AACCTGGTGGAATTGTTATTTATGACCTTTTCTTTACAAATGAGATTTCTGAAGCTCAGAGA 
AATTAAGCAACGAGAT GAAG G T CAC C CAGC T G T G T GCAC T GAC C T GT T TAGAAAATACTGGC 
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CTTTCTGGGACCAAGGCAGGGATGCTTTGCCCTGCCCTCTATGCCTCTCTGTGCCTCTCCAC 
TCCCTCTCCCCTCCTCCAACATTCCCTCCCTTCTGTCTCCAGCAGCCCCAGAGACCAGAACT 
GAT C C AGAG C T GGAGAAAGAAGCCGAAGGC T C T TAGGGAGCAGC CAGAGGGCCAAGT GAC CA 

AGAGGATGGGGCCTGAGCTGGGGAAGGGGTGGCATCGAGGACCTTCTTGCATTCTCCTGTGG 
GAAGCCCAGTGCCTTTGCTCCTCTGTCCTGCCTCTACTCCCACCCCCACTACCTCTGGGAAC 
CAC AG C T C C AC AAG G G G G AGAGGCAGC T G GG C CAGAC C GAGG T C ACAGC CAC T C C AAG T C C T 

GCCCTGCCACCCTCGGCCTCTGTCCTGGAAGCCCCACCCCTTTCTTCCTGTACATAATGTCA 
CTGGCTTGTTGGGATTTTTAATTTATCTTCACTCAGCACCAAGGGCCCCGGACACTCCACTC 
CTGCTGCCCCTGAGCTGAGCAGAGTCATTATTGGAGAGTTTTGTATTTATTAAAACATTTCT 
TTTTCAGTCTTTGGGCATGAGGTTGGCTCTTTGTGGCCAGGAACCTGAGTGGGGCCTGGTGG 
AGAAGGGGCNGAGAG TAGGAGGT GAGAGAGAGGAGCTC TGACAC T T GGGGAGC TGAAAGAGA 
CCTGGAGAGGCAGAGGATAGCGTGGCNNTTGGCTGGCATNCCTGGGTTCCGCAGAGGGGCTG 
GGGAT GGT T C TTGAGAT GGTCTAGAGACTCAAGAATTTAGGGAAGTAGAAGCAGGAT TTTGA 
CTCAAGTTTAGTTTCCCACATCGCTGGCCTGTTTGCTGACTTCATGTTTGAAGTTGCTCCAG 
AGAGAGAATCAAAGGTGTCACCAGCCCCTCTCTCCCTCCTTCCCTTCCCTTCCCTTTCTTTC 
CCTCCCCTCCCCTCCCCTCCCCTCCCCTCC 
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GGCCGAGCGGGGGTGCTGCGCGGCGGCCGTGATGGCTGGTGACGGCGGGGCCGGGCAGGGGA 

CCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGAATCACCGCCTGGCCCGAC 
T C CAC CAT GAAC GTCGCGCT GC AGGAGCT GGGAG C T GGCAGCAAC GT GGGAT T C CAGAAGGG 

GAG AAG AC AG C T GT T AGG C T CAC GC AC GC AGC T GGAG C T GGT CT T AGC AGGT GC C T CT CTAC 
TGCTGGCTGCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCA 
TCCCACAGCACCTGCCTTACAGAGGCCTGCATTCGAGTGGCTGGAAAAATCCTGGAGTCCCT 
GGACCGAGGGGTGAGCCCCTGTGAGGACTTTTACCAGTTCTCCTGTGGGGGCTGGATTCGGA 
GGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAAC 
C AGGC CAT AC T G AAG CAC C T GC T T GAAAAC AC CAC C T T CAAC T C CAGC AG T GAAG C T G AGC A 
GAAGACAC AGC GC T T C TAC C TATCTTGCC TACAGGT GGAGCGCATTGAGGAGC T GGGAGCC C 
AGC CAC T GAGAGAC C T CAT T GAGAAGATT GGT GGT T GGAACAT TAC GGGGC CC T GGGACCAG 
G AC AAC T T TAT G G AGG T G T T GAAGGC AG TAG C AGGGAC C TAC AG G GC C AC C CC AT T C T T CAC 
CGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGGACCAGTCTG 
GGC TCTTTCTGCCCTCTC GGGAT TAC TAC T TAAACAGAACTGCCAATGAGAAAGTAAGGAAC 
ATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGCTGATCCCTGTTGACTTTTCCCT 
TTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTGTCACCTAGTGAACAAACTGCCCCT 
CCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTTTCTTCCCCTTTTCCTTCCTTCCTTCC 
TCTTATTCTTCTAGTAGGTTTCATAGACACCTACTGTGTGCCAGGTCCAGTGGGGGAATTCG 
GAGAT AT AAG T T T C C GAG C CAT T G C CAC AG G AAGC G T T CAG T G T C GAT GGG T TCAT GGAC C T 
AGAT AG G C T GAT AAC AAAG C T C ACAAGAGGGT C C T GAGGAT T CAGGAGAGAC T T ATGGAGC C 
AGCAAAG T C T T C C T GAAGAGAT T GCAT TT GAGC CAGG T C C T G TAG 
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ATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGCTGGCATCCTGCAGGCCCCC 
TTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATCGGTGTGGTCATGGGCCA 
TGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTGCGGC 
C CT GGT GGCAGAAT GAG TC C CT GGCAGCCT TCCGGAACCACACGGCCTGCA.TGGAGGAACAG 
TAC AAT C AAT AC C AG GT C AAT GGGGAGAGGCT CAAC GGC CG C C AGAC G C T GGGGGAGAACAT 
T GC T G ACAAC GGGGGGCT GAAGGC T GC CT ACAAT GCT TACAAAGCAT G GC T GAGAAAGCAT G 
GG GAG G AGCAG CAAC T G C C AG CCGTGGGGCT C AC CAAC CAC CAGC T CT T C T T C G T GGGAT T T 
GC C CAGG T GT GGT GCT C GGT CC GCACACCAGAGAGCTCTCACGAGGGGCTGGT GAC CGAC C C 
CCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGGC 
ACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACC 
TGGATCAGGGGAGAAATGGCCAGCTGTCACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGT 
TTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACA 
GGTGACATGAGTACAGACCCTCCTCAATCACCACATTGTGCCTCTGCTTTGGGGGTGCCCCT 
GCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCTGCCTGGAAGAG 
GTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCCTCTTCTGTCCCCAGGCTCACT 
CAGCCTGGCGGCCATGGGGCCTGCCGTGCCTGCCCCACTGTGACCCACAGGCCTGGGTGGTG 
TACCTCCTGGACTTCTCCCCAGGCTCACTCAGTGCGCACTTAGGGGTGGACTCAGCTCTGTC 
TGGCTCACCCTCACGGGCTACCCCCACCTCACCCTGTGCTCCTTGTGCCACTGCTCCCAGTG 
CTGCTGCTGACCTTCACTGACAGCTCCTAGTGGAAGCCCAAGGGCCTCTGAAAGCCTCCTGC 
TGCCCACTGTTTCCCTGGGCTGAGAGGGGAAGTGCATATGTGTAGCGGGTACTGGTTCCTGT 
G TC T T AGG G C AC AAG C C T T AGC AAATGAT TGAT T C T CCC T GGACAAAGCAGGAAAG CAGATA 
GAG C AGG GAAAAGGAAGAAC AGAG T T T AT TT T TACAGAAAAGAGGG T GGGAGG GT GT GGT C T 
T GG C C C T TATAGG AC C 
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